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Description 

IDENTIFICATION OF NOVEL MS4A GENE FAMILY MEMBERS 
EXPRESSED BY HEMATOPOIETIC CELLS 



5 



Cross Reference to Related Applications 



This application is based on and claims priority to United States 
Provisional Application Serial Number 60/254,362, filed December 8, 2000. 
and United States Provisional Application Serial No. 60/270.057 filed 
February 20, 2001 , herein incorporated by reference in their entirety. 



This work was supported by NIH grants CA-81776 and CA-54464. 
Thus, the U.S. Government has rights in the invention. 



The present invention generally relates to a new class of MS4A 
proteins characterized by a membrane-embedded structure. More 
particularly, the present invention provides MS4A nucleic acid and 
polypeptide sequences, chimeric genes comprising disclosed MS4A 
20 sequences, antibodies that specifically recognize MS4A polypeptides, and 
uses thereof. 
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American Tissue Culture Collection 
CD20 B lymphocyte differentiation antigen 
high-affinity IgE receptor B chain 
green fluorescent protein 
GenBank human genomic database 
hematopoietic CD20-like antigen 
membrane spanning 4-domain family, 
subfamily A 
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Background Art 

CD20, FceRIp, and HTm4 are three cell surface proteins expressed 
by hematopoietic cells that represent members of a nascent gene family 
• (Adra et al. (1999) Clin Genet 55:431-437, Kinet (1999) Annu Rev Immunol 

5 17:931-972; Tedder and Engel (1994) Immunol Today 15:450-454). The 
deduced amino acid sequence of human and mouse CD20 first 
demonstrated a cell surface protein containing four membrane-spanning 
regions, N- and C-terminal cytoplasmic domains, and an -50 amino acid 
loop that serves as the extracellular domain (Einfeld et al. (1988) EMBO J 

10 7:711-717; Stamenkovic and Seed (1988) J Exp Med 167:1975-1980; 
Tedder et al. (1988a) J Immunol 141:4388-4394; Tedder et al. (1988b) Proc 
Natl Acad Sci USA 85:208-212). Human CD20 shares 20% amino acid 
sequence identity with FceRip and HTm4 (Adra et al. (1994) Proc Natl Acad 
Sci USA 91:10178-10182, Kuster et al. (1992) J Biol Chem 267:12782- 

15 12787). Moreover, these three proteins have a similar overall structure in 
man, mouse, and rat with significant sequence identity within the first three 
membrane-spanning domains (Kinet et al. (1988) Proc Natl Acad Sci USA 
85:6483-6487; Ra et al. (1989) Nature 19:1771-7; Tedder et al., 1988a). In 
addition, all three genes are located in the same region of human 

20 chromosome 11q12-13.1 (Adra et al., 1994; Hupp et al. (1989) J Immunol 
143:3787-3791; Tedder et al. (1989a) J Immunol 142:2555-2559) and 
mouse chromosome 19 (Hupp et al. 1989; Tedder et al., 1988a). These 
three genes are therefore likely to have evolved from a common precursor. 
Despite structural and sequence conservation between CD20, FcsRip 

25 and HTm4, transcription of each gene is differentially regulated. CD20 is 
only expressed by B lymphocytes (Stashenko et al. (1980) J Immunol 
125:1678-1685; Tedder et al., 1988a). FcsRip is expressed by mast cells 
and basophils (Kinet, 1999). HTm4 is expressed by diverse lymphoid and 
myeloid origin hematopoietic cells (Adra et al., 1994). 

30 Although the function of HTm4 remains unexplored, CD20 and FceRIp 

have critical roles in cell signaling. CD20 forms a homo- or hetero-tetrameric 
complex that is functionally important for regulating cell cycle progression 
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and signal transduction in B lymphocytes (Tedder and Engel, 1994). CD20 
additionally regulates transmembrane Ca ++ conductance, possibly as a 
functional component of a Ca ++ -permeable cation channel (Bubien et al. J 
Cell Biol 121:1121-1132; Kanzaki et al. (1997a) J Biol Chem 272:14733- 
5 14739; Kanzaki et al. (1997b) J Biol Chem 272:4964-4969; Kanzaki et al. 

(1995) J Biol Chem 270:13099-13104). FceRip is part of a tetrameric 
receptor complex consisting of a, p, and two y chains (Blank et al. (1989) 
Nature 337:187-189). FceRip mediates interactions with IgE-bound antigens 
that lead to cellular responses such as the degranulation of mast cells. 

10 Specifically, the FceRip subunit functions as an amplifier of FceRlp-mediated 
activation signals (Dombrowicz et al. (1998) Immunity 8:517-529; Lin et al. 

(1996) Cell 85:985-995). Because of their unique structure and sequence 
homology, CD20, FceRip, and HTm4 are likely to share overlapping 
functional properties. 

15 CD20 and FceRip are also important clinically. Antibodies against 

CD20 are effective in treating non-Hodgkin's lymphoma (McLaughlin et al. 
(1998) Oncology 12:1763-1769; Onrust et al. (1989) J Biol Chem 
264:15323-15327; Weiner (1999) Semin Oncol 26:43-51). Genetic 
variations at chromosome 11q12-13 can also play a role in the pathogenesis 

20 of allergic diseases (Adra et al., 1999; Kinet, 1999). Recent studies suggest 
that FceRip contributes to such diseases, and other genetic elements in this 
region likely also contribute to allergic disease. 

Since CD20, FceRip, and HTm4 are likely to have evolved by 
duplication of an ancestral gene, other related proteins might exist that form 

25 additional receptor complexes. In view of the clinical importance noted 
above, the identification of such proteins thus represents a long-felt and 
ongoing need in the art. To address this need, applicants have identified 
novel human and mouse proteins that span the cell membrane at least four 
times and share high levels of amino acid sequence identity with CD20, 

30 FceRip, and HTm4. This finding reveals a new gene family that has been 
designated herein as the MS4A family (membrane spanning 4-domain 
family, subfamily A). Currently this family contains at least 10 subgroups 
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(MS4A1 through MS4A12) that encode at least 21 previously unidentified 
human and mouse proteins expressed by hematopoietic cells and by diverse 
cell types in non-hematopoietic tissues. 

5 Summary of the Invention 

The present invention discloses isolated MS4A polypeptides and 
isolated nucleic acid molecules encoding the same. Preferably, an isolated 
MS4A polypeptide, or functional portion thereof, comprises a polypeptide 
encoded by the nucleic acid molecule of any one of the odd numbered SEQ 

10 ID NOs:1-37 a polypeptide encoded by a nucleic acid molecule that is 
substantially identical to any one of the odd-numbered SEQ ID NOs:1-37, a 
polypeptide fragment encoded by a 20 nucleotide sequence that is identical 
to a contiguous 20 nucleotide sequence of any one of the odd-numbered 
. SEQ ID NOs:1-37, a polypeptide having an amino acid sequence of any one 

15 of the even-numbered SEQ ID NOs:2-38, a polypeptide that is a biological 
equivalent of any one of the even-numbered SEQ ID NOs:2-38, or a 
polypeptide that is immunologically cross-reactive with an antibody that 
shows specific binding with a polypeptide comprising some or all amino 
acids of any one of the even-numbered SEQ ID NOs:2-38. 

20 The present invention further teaches chimeric genes having a 

heterologous promoter that drives expression of a nucleic acid sequence 
encoding a MS4A polypeptide. Preferably, the chimeric gene is carried in a 
vector and introduced into a host cell so that a MS4A polypeptide of the 
present invention is produced. Preferred host cells include but are not 

25 limited to a bacterial cell, a hamster cell, a mouse cell, or a human cell. 

In another aspect of the invention, a method is provided for detecting 
a nucleic acid molecule that encodes a MS4A polypeptide. According to the 
method, a biological sample having nucleic acid material is hybridized under 
stringent hybridization conditions to a MS4A nucleic acid molecule of the 

30 present invention. Such hybridization enables a nucleic acid molecule of the 
biological sample and the MS4A nucleic acid molecule to form a detectable 
duplex structure. Preferably, the MS4A nucleic acid molecule includes some 
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or all nucleotides of any one of the odd-numbered SEQ ID NOs:1-37. Also 
preferably, the biological sample comprises human nucleic acid material. 

The present invention further teaches an antibody that specifically 
recognizes a MS4A polypeptide. Preferably, the antibody recognizes some 
5 or all amino acids of any one of the even-numbered SEQ ID NOs:2-38. A 
method for producing a MS4A antibody is also disclosed, and the method 
comprises recombinantly or synthetically producing a MS4A polypeptide, or 
portion thereof; formulating the MS4A polypeptide so that it is an effective 
immunogen; immunizing an animal with the formulated polypeptide to 

10 generate an immune response that includes production of MS4A antibodies; 
and collecting blood serum from the immunized animal containing antibodies 
that specifically recognize a MS4A polypeptide. Antibody-producing cells 
can be optionally fused with an immortal cell line whereby a monoclonal 
antibody that specifically recognizes a MS4A polypeptide can be selected. 

15 Preferably, the MS4A polypeptide used as an immunogen includes some or 
all amino acid sequences of any one the even-numbered SEQ ID NOs:2-38. 

A method is also provided for detecting a level of MS4A polypeptide 
using an antibody that specifically recognizes a MS4A polypeptide. 
According to the method, a biological sample is obtained from an 

20 experimental subject and a control subject, and a MS4A polypeptide is 
detected in the sample by immunochemical reaction with the MS4A 
antibody. Preferably, the antibody recognizes amino acids of any one of the 
even-numbered SEQ ID NOs:2-38, and is prepared according to a method 
of the present invention for producing such an antibody. 

25 The present invention further discloses a method for identifying a 

compound that modulates MS4A function. The method comprises: exposing 
an isolated MS4A polypeptide to one or more compounds, and assaying 
binding of a compound to the isolated MS4A polypeptide. A compound is 
selected that demonstrates specific binding to the isolated MS4A 

30 polypeptide. Preferably, the MS4A polypeptide used in the binding assay of 
the method includes some or all amino acids of any one of the even- 
numbered SEQ ID NOs:2-38. 
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Also provided is a method for identifying a regulator of MS4A gene 
expression. The method comprises (a) exposing a cell sample with a 
candidate compound to be tested, the cell sample containing at least one 
cell containing a DNA construct comprising a modulatable transcriptional 

5 regulatory sequence of a MS4A-encoding nucleic acid and a reporter gene 
which is capable of producing a detectable signal; (b) evaluating an amount 
of signal produced in relation to a control sample; and (c) identifying a 
candidate compound as a modulator of MS4A gene expression based on the 
amount of signal produced in relation to a control sample. Preferably, the 

10 modulatable transcriptional regulatory sequence of a MS4A-encoding 
nucleic acid comprises a sequence that is immediately upstream of the initial 
coding region of a MS4A gene as set forth in any one of SEQ ID NOs:73-81 . 

The present invention further provides a method for modulating MS4A 
function in a subject. According to the method, a pharmaceutical 

15 composition is prepared that includes a substance capable of modulating 
MS4A expression or function, and a carrier. An effective dose of the 
pharmaceutical composition is administered to a subject, whereby MS4A 
activity is altered in the subject. Provided are therapeutic methods wherein 
a change in MS4A activity comprises a shift in the abundance of cell 

20 subpopulations expressing said protein, modulation of [Ca 2+ ]i levels, or 
altered cell function. In a preferred embodiment, the substance used to 
perform this method shows specific binding to some or all amino acids of 
any one of the even-numbered SEQ ID NOs:2-38, and was discovered by a 
method of the present invention. In another embodiment, MS4A function is 

25 disrupted by immunizing a subject with an effective dose of the disclosed 
MS4A polypeptide. The immune system of the subject produces an 
antibody that specifically recognizes the MS4A polypeptide, and preferably 
recognizes some or all of amino acids of any one of the even-numbered 
SEQ ID NOs:2-38. In a further embodiment, a gene therapy vector is used, 

30 the vector comprising a nucleotide sequence encoding a MS4A polypeptide. 
Alternatively, the gene therapy vector comprises a nucleotide sequence 
encoding a nucleic acid molecule, a peptide, or a protein that interacts with a 
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MS4A nucleic acid or polypeptide. Preferably, the subject is a human 
subject. 

Accordingly, it is an object of the present invention to provide novel 
MS4A nucleic acid and polypeptide sequences, and novel methods relating 
5 thereto. This object is achieved in whole or in part by the present invention. 

An object of the invention having been stated above, other objects 
and advantages of the present invention will become apparent to those 
skilled in the art after a study of the following description of the invention, 
Figures and non-limiting Examples. 

10 

Brief Description of the Drawings 
Figure 1 depicts cDNAs encoded by fifteen new human or mouse 
MS4A gene products. Consensus sequences from cDNAs and overlapping 
ESTs are indicated by their GenBank Accession numbers. Representative 

1 5 full-length cDNAs for each gene product are shown, except for MS4a3 which 
was not full-length. 5* and 3' untranslated sequences are shown as 
horizontal lines with relative nucleotide lengths shown. Coding regions are 
shown as boxes with translation initiation and termination codons and their 
relative nucleotide locations shown. Poly(A) attachment signal sequences 

20 (AATAAA) are indicated when known. Deduced hydrophobic regions are 
shown as filled boxes with the predicted membrane-spanning domains 
shown as TM1-TM4. Additional hydrophobic regions in MS4A4 proteins are 
shown as shaded boxes. Sites of putative nucleotide polymorphisms in 
MS4A6A are indicated by two (X)s. 

25 Figure 2 depicts exon-intron organization of the human MS4A genes. 

The maps were constructed by aligning known and predicted MS4A cDNA 
sequences with human genomic sequences as described in Materials and 
Methods. Exons are shown as boxes with the predicted translation initiation 
codons (ATG), transmembrane domains (TM) and termination codons 

30 indicated on the top. All exon and intron distances are shown to scale. 
Gaps indicate where intron distances have not been determined for MS4A3, 
MS4A4A, and MS4A12. Two long introns present in MS4A4E are not to 
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scale but the intron lengths are indicated. Exon numbering for MS4A1, and 
MS4A2 is as published (Kuster et al., 1992; Tedder et al.. 1988a; Tedder et 
al., 1988b). 

Figure 3 shows human MS4A4E protein and transcript sequences 

5 predicted from genomic DNA sequences. MS4A4E sequences are 
compared with human MS4A4A cDNA (disclosed herein) and genomic 
sequences. Gaps were introduced to provide optimal alignment. The boxed 
AAC sequence near the 5' end of the MS4A4A sequence indicates the 
length of the most 5' MS4A4A cDNA sequence. Sequences upstream of this 

10 are based on contiguous genomic DNA sequences. Nucleotide numbering 
is based on the MS4A4A cDNA sequence, disclosed herein. Predicted 
translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates predicted translation 
termination codons. Potential poly-A attachment signal sequences 

15 (AATAAA) are boxed. 

Figure 4 shows human MS4A6E protein and transcript sequences 
predicted from genomic DNA and overlapping cDNA sequences. 
PredictedMS4A6E transcript sequences are compared with human MS4A6A 
cDNA sequence (disclosed herein). Gaps were introduced in the nucleotide 

20 sequence to provide optimal alignment. The 5' end of both transcripts start 
at 3" splice-acceptor sites which demark the first translated exons for both 
genes. The 5' end of the putative MS4A6E transcript is based on genomic 
DNA sequence, while the predicted sequences starting at nucleotide 60 
were based on both genomic DNA sequences and overlapping cDNA 

25 sequences. A gap in the MS4A6A sequence is indicated where TM1/2 and 
TM2 exons are not found in MS4A6E transcripts. MS4A6A nucleotide 
numbering is based on the cDNA sequence (disclosed herein). Predicted 
translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates the predicted translation 

30 termination codon for the MS4A6E protein. 

Figure 5 shows human MS4A10 protein and transcript sequences 
predicted from human genomic DNA sequences. MS4A10 nucleotide 
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sequence is compared with mouse MS4a10 cDNA sequence (disclosed 
herein). The 5 1 end of both transcripts start at 3' splice-acceptor sites which 
demark the first translated exons for both genes. MS4a 10 nucleotide 
numbering is based on the cDNA sequence (disclosed herein). Predicted 

5 . translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates predicted translation 
termination codon for the MS4A10 protein. Potential poly-A attachment 
signal sequences (AATAAA) are boxed. 

Figure 6 depicts a physical linkage map for the MS4A genes. A 

10 scheme for chromosome 11 structure is shown on the left with the mapped 
locations for MS4A1, MS4A2 and MS4A3 indicated. Representative human 
BAC clones are shown as vertical black bars with clone names shown on the 
top and clone size shown at the bottom. All distances are shown to the 
indicated scale. The distance between and spatial relationship of RP11- 

15 312N17 to the four other overlapping BACs shown at the bottom are 
unknown. Thin bars indicate continuous characterized (mapped or 
sequenced) regions of DNA that contain identified MS4A genes. When the 
relative position of this region of DNA is known relative to the representative 
BACs that are shown, the thin bars overlay the BAC. The mapped position 

20 of each MS4A gene is indicated on the right with the relative direction of 
gene translation indicated by arrows (-*■). In some cases, approximate 
distances between MS4A genes (termination codons to the translation 
initiation codon for the next gene) are indicated in base pairs (bp). In some 
cases, approximate MS4A gene size is indicated showing the distance 

25 between predicted translation initiation codons and translation termination 
codons as show in Figure 7. 

Figure 7 depicts deduced amino acid sequences for CD20 (human 
A1, SEQ ID NO:40; mouse a1, SEQ ID NO:48), FceRIp (human A2, SEQ ID 
NO:42; mouse a2. SEQ ID NO:50), HTm4 (human A3, SEQ ID NO:44; 

30 mouse a3, SEQ ID NO:20), and 19 new MS4A (human) (even-numbered 
SEQ ID NOs:2-18, 46) and MS4a (mouse and pig) proteins (even-numbered 
SEQ ID NOs:22-38, 56). Gaps were introduced to optimize alignments. 
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Numbers represent predicted residue positions. The predicted membrane- 
spanning regions (TM1-TM4) are indicated. Predicted intron|exon splice 
junctions are indicated by vertical bars where information was available. 
Amino acids common to 10 or more proteins are shaded, "indicates partial 

5 sequence for the MS4a3 protein. CD20, FceRIB, and HTm4 sequences and 
known intron|exon borders (SEQ ID NOs:39-44, 47-50) are as published 
(Adra et al., 1994; Kiister et at., 1992; Ra et al., 1989; Tedder et al., 1988a; 
Tedder et al., 1989b; Tedder et al., 1988b). MS4A12 represents a 
conceptual translation (SEQ ID NO:46) of a human colon mucosa cDNA 

10 sequence (GenBank AK000224, SEQ ID NO:45), and MS4a12 represents a 
conceptual translation (SEQ ID NO:56) of a homologous cDNA sequence 
from pig (GenBank AJ236932, SEQ ID NO:55). 

Figure 8 depicts UPGMA (unweighted pair group method using 
arithmetic averages) tree of deduced MS4A and MS4a protein sequences. 

15 Horizontal tree branch length is a measure of sequence relatedness. For 
example, MS4a4B and MS4a4C are the most similar in sequence, while 
CD20 (MS4A1) sequences were the most divergent from other family 
members. The MS4a12p sequence was from pig, while all other MS4a 
sequences were from mouse. The UPGMA tree was generated using 

20 Geneworks version 2.0 (IntelliGenetics, Inc., Mountain View, California, 
USA). 

Figure 9 shows immunofluorescent detection of CD20 expression 
during B cell development. Single cell suspensions of leukocytes were 
isolated from wild-type mice, stained using MB20-13 (visualized using a PE- 

25 conjugated, anti-mouse lgG3 antibody) and anti-B220 (FITC-conjugated) 
monoclonal antibodies, and examined by two-color immunofluorescence 
staining with flow cytometry analysis. Quadrant gates indicate negative and 
positive populations of cells as determined using isotype-matched control 
monoclonal antibodies. The gated cell populations correspond to the cells 

30 described in Table 7, and are shown for reference. These results are 
representative of those obtained with six (6) two month-old wild type mice. 
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Figure 10 summarizes the strategy for targeted disruption of the 
mouse CD20 gene. 

Figure 10A shows genomic clones encoding CD20. 

Figure 10B shows the intron-exon organization of the wild typeCD20 
5 allele containing exons 5-8 (shaded squares). 

Figure 10C shows the structure of the CD20 targeting vector. 

Figure 10D shows the predicted structure of the CD20 allele after 
gene targeting in ES cells by homologous recombination. The EcoR V 
restriction site in exon 6 is deleted as indicated. 
10 Figure 10E presents Southern blot analysis of tail DNA from two wild 

type and four CD20"'' mice. Genomic DNA was digested with EcoR V, 
transferred to nitrocellulose and hybridized with the 5' probe indicated in (D). 

Figure 10F shows PCR amplification of genomic DNA from wild type 
and CD20" 7 ' mice using primers that bind in exons 6 and 7. Amplification of 
15 glyceraldehyde-3-phosphate dehydrogenase (G3PDH) is shown as a 
positive control. 

Figure 10G shows PCR amplification of cDNA generated from splenic 
RNA of wild type and CD20' / " mice. Each reaction mixture contained a 
sense primer that hybridized with sequences encoded by exon 3 and 
20 antisense primers that hybridized with either exon 6 or Ned gene promoter 
sequences. 

Figures 10H and 101 show reactivity of the MB20-13 monoclonal 
antibody with CD20 cDNA-transfected (thick line) or untransfected (dashed 
line) 300.19 cells (Figure 10H) or Chinese Hamster Ovary (CHO) cells 
25 (Figure 101). The thin lines represent CD20 cDNA-transfected cells stained 
with secondary antibody alone or an isotype-control monoclonal antibody. 
Indirect immunofluorescence staining was visualized by flow cytometry 
analysis. 

Figure 10J shows immunofluorescent staining of splenocytes from 
30 CD20"'" or wild type mice with MB20-13 (visualized using a PE-conjugated, 
anti-mouse lgG3 antibody) and anti-B220 (FITC-conjugated) monoclonal 
antibodies. Splenocytes from CD20" 7 " mice generated histograms identical to 
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those obtained without MB20-1 monoclonal antibody present, using the 
secondary antibody alone. 

Figure 11 depicts immunofluorescent detection of B lymphocyte 
subpopulations in CD20*'" and wild type mice. Lymphocytes were isolated 
5 and examined by two color immunofluorescent staining with flow cytometry 
analysis. Quadrants delineated by squares indicate negative and positive 
populations of cells as determined using unreactive monoclonal antibody 
controls. The gated cell populations correspond to the cells described in 
Table 7 that represent at least 6 mice of each genotype. 

10 Figure 12 shows altered signal transduction in CD20" y " B cells. Figure 

12 also shows CD19 expression by splenocytes from CD20~ / ~ (thin line) and 
wild type (thick line) mice. Immunofluorescence staining using PE- 
conjugated anti-CD19 monoclonal antibody with flow cytometry analysis. 
The dashed line represents staining of wild type splenocytes with a control 

1 5 antibody. 

Figure 12A presents calcium responses induced by BCR or CD19 
ligation in CD20" 7 " and wild type B cells. Splenocytes were loaded with 1 pM 
indo-1-AM ester and B cells were stained with FITC-conjugated anti-B220 
antibody. At 1 min (arrow), optimal concentrations of goat anti-IgM F(ab , ) 2 

20 antibody fragments, anti-CD1 9 monoclonal antibody or Thapsigargin were 
added, with or without EGTA present. Increased ratios of indo-1 
fluorescence indicate increased [Ca 2+ ]i. Results represent those from at 
least four experiments. 

Figure 12B presents assays of tyrosine phosphorylation of proteins 

25 from purified splenic B cells of CD20" / " and wild type mice. B cells (2 x 
10 7 /sample) were incubated with anti-IgM antibody for the times shown and 
detergent lysed. Proteins were resolved by SDS-PAGE, transferred to 
nitrocellulose and immunoblotted with anti-phosphotyrosine (anti-PTyr) 
antibody. The blot was stripped and reprobed with anti-SHP-1 antibody as a 

30 control for equivalent protein loading. Western blots from two of three 
experiments are shown to demonstrate the range of results. 
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Detailed Description of the Invention 
The present invention provides isolated nucleic acids encoding MS4A 
polypeptides (representative embodiments set forth as the odd-numbered 
SEQ ID NOs:1-37), isolated MS4A polypeptides (representative 
5 embodiments set forth as the even-numbered SEQ ID NOs:2-38), and uses 
thereof. The disclosed MS4A nucleic acids and polypeptides can be used 
according to methods of the present invention for drug discovery screens, for 
therapeutic treatment of atopic conditions, and for therapeutic regulation of 
[Ca 2+ ]i levels, among other uses. 

10 

I DEFINITIONS 

While the following terms are believed to be well understood by one 
of ordinary skill in the art, the following definitions are set forth to facilitate 
explanation of the invention. The entire contents of all publications 
15 mentioned herein, including the discussion of the background art presented 
above, are hereby fully incorporated by reference. 
I. A. MS4A nucleic acids 

The nucleic acid molecules provided by the present invention include 
the isolated nucleic acid molecules of any one of the odd-numbered SEQ ID 

20 NOs:1-37, sequences substantially similar to sequences of any one of the 
odd-numbered SEQ ID NOs:1-37, conservative variants thereof, 
subsequences and elongated sequences thereof, complementary DNA 
molecules, and corresponding RNA molecules. The present invention also 
encompasses genes, cDNAs, chimeric genes, and vectors comprising 

25 disclosed MS4A nucleic acid sequences. 

The term "nucleic acid molecule" refers to deoxyribonucleotides or 
ribonucleotides and polymers thereof in either single- or double-stranded 
form. Unless specifically limited, the term encompasses nucleic acids 
containing known analogues of natural nucleotides which have similar 

30 properties as the reference natural nucleic acid. Unless otherwise indicated, 
a particular nucleotide sequence also implicitly encompasses conservatively 
modified variants thereof (e.g. degenerate codon substitutions), 
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complementary sequences, subsequences, elongated sequences, as well as 
the sequence explicitly indicated. The terms "nucleic acid molecule" or 
"nucleotide sequence" can also be used in place of "gene", "cDNA", or 
"mRNA". Nucleic acids can be derived from any source, including any 
5 organism. 

The term "isolated", as used in the context of a nucleic acid molecule, 
indicates that the nucleic acid molecule exists apart from its native 
environment and is not a product of nature. An isolated DNA molecule can 
exist in a purified form or can exist in a non-native environment such as a 

1 0 transgenic host cell. 

The term "purified", when applied to a nucleic acid, denotes that the 
nucleic acid is essentially free of other cellular components with which it is 
associated in the natural state. Preferably, a purified nucleic acid molecule 
is a homogeneous dry or aqueous solution. The term "purified" denotes that 

15 a nucleic acid or protein gives rise to essentially one band in an 
electrophoretic gel. Particularly, it means that the nucleic acid is at least 
about 50% pure, more preferably at least about 85% pure, and most 
preferably at least about 99% pure. 

The term "substantially identical", the context of two nucleotide or 

20 amino acid sequences, can also be defined as two or more sequences or 
subsequences that have at least 60%, preferably 80%, more preferably 90- 
95%, and most preferably at least 99% nucleotide or amino acid sequence 
identity, when compared and aligned for maximum correspondence, as 
measured using one of the following sequence comparison algorithms 

25 (described herein below under the heading Nucleotide and Amino Acid 
Sequence Comparisons ) or by visual inspection. Preferably, the substantial 
identity exists in nucleotide sequences of at least 50 residues, more 
preferably in nucleotide sequence of at least about 100 residues, more 
preferably in nucleotide sequences of at least about 150 residues, and most 

30 preferably in nucleotide sequences comprising complete coding sequences. 
In one aspect, polymorphic sequences can be substantially identical 
sequences. The term "polymorphic" refers to the occurrence of two or more 
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genetically determined alternative sequences or alleles in a population. An 
allelic difference can be as small as one base pair. 

Another indication that two nucleotide sequences are substantially 
identical is that the two molecules specifically or substantially hybridize to 
5 each other under stringent conditions. In the context of nucleic acid 
hybridization, two nucleic acid sequences being compared can be 
designated a "probe" and a "target". A "probe" is a reference nucleic acid 
molecule, and a '"target" is a test nucleic acid molecule, often found within a 
heterogenous population of nucleic acid molecules. A 'target sequence" is 
1 0 synonymous with a "test sequence". 

A preferred nucleotide sequence employed for hybridization studies 
or assays includes probe sequences that are complementary to or mimic at 
least an about 14 to 40 nucleotide sequence of a nucleic acid molecule of 
the present invention. Preferably, probes comprise 14 to 20 nucleotides, or 
15 even longer where desired, such as 30, 40, 50, 60, 100, 200, 300, or 500 
nucleotides or up to the full length of any of those set forth as the odd- 
numbered SEQ ID NOs:1-37. Such fragments can be readily prepared by, 
for example, directly synthesizing the fragment by chemical synthesis, by 
application of nucleic acid amplification technology, or by introducing 
20 selected sequences into recombinant vectors for recombinant production. 

The phrase "hybridizing specifically to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide 
sequence under stringent conditions when that sequence is present in a 
complex nucleic acid mixture (e.g., total cellular DNA or RNA). The phrase 
25 "binds substantially to" refers to complementary hybridization between a 
probe nucleic acid molecule and a target nucleic acid molecule and 
embraces minor mismatches that can be accommodated by reducing the 
stringency of the hybridization media to achieve the desired hybridization. 

"Stringent hybridization conditions" and "stringent hybridization wash 
30 conditions" in the context of nucleic acid hybridization experiments such as 
Southern and Northern blot analysis are both sequence- and environment- 
dependent. Longer sequences hybridize specifically at higher temperatures. 
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An extensive guide to the hybridization of nucleic acids is found in Tijssen 
(1993) Laboratory Techniques in Biochemistry and Molecular Bioloqy- 
Hvbridization with Nucleic Acid Probes , part I chapter 2, Elsevier, New York, 
New York. Generally, highly stringent hybridization and wash conditions are 

5 selected to be about 5°C lower than the thermal melting point (T m ) for the 
specific sequence at a defined ionic strength and pH. Typically, under 
"stringent conditions" a probe will hybridize specifically to its target 
subsequence, but to no other sequences. 

The T m is the temperature (under defined ionic strength and pH) at 

10 which 50% of the target sequence hybridizes to a perfectly matched probe. 
Very stringent conditions are selected to be equal to the T m for a particular 
probe. An example of stringent hybridization conditions for Southern or 
Northern Blot analysis of complementary nucleic acids having more than 
about 100 complementary residues is overnight hybridization in 50% 

15 formamide with 1 mg of heparin at 42°C. An example of highly stringent 
wash conditions is 15 minutes in 0.1 5 M NaCI at 65°C. An example of 
stringent wash conditions is 15 minutes in 0.2X SSC buffer at 65°C (See 
Sambrook et al. eds. (1989) Molecular Cloning: A Laboratory Manual. Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York for a 

20 description of SSC buffer). Often, a high stringency wash is preceded by a 
low stringency wash to remove background probe signal. An example of 
medium stringency wash conditions for a duplex of more than about 100 
nucleotides, is 15 minutes in 1X SSC at 45°C. An example of low stringency 
wash for a duplex of more than about 100 nucleotides, is 15 minutes in 4-6X 

25 SSC at 40°C. For short probes (e.g., about 10 to 50 nucleotides), stringent 
conditions typically involve salt concentrations of less than about 1 .0 M Na + 
ion, typically about 0.01 to 1.0 M Na + ion concentration (or other salts) at pH 
7.0-8.3, and the temperature is typically at least about 30°C. Stringent 
conditions can also be achieved with the addition of destabilizing agents 

30 such as formamide. In general, a signal to noise ratio of 2-fold (or higher) 
than that observed for an unrelated probe in the particular hybridization 
assay indicates detection of a specific hybridization. 
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The following are examples of hybridization and wash conditions that 
can be used to clone homologous nucleotide sequences that are 
substantially identical to reference nucleotide sequences of the present 
invention: a probe nucleotide sequence preferably hybridizes to a target 
5 nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4) 1 
mM EDTA at 50°C followed by washing in 2X SSC, 0.1% SDS at 50°C; more 
preferably, a probe and target sequence hybridize in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C followed by washing in 1X 
SSC, 0.1% SDS at 50°C; more preferably, a probe and target sequence 
10 hybridize in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 
50°C followed by washing in 0.5X SSC, 0.1% SDS at 50°C; more preferably, 
a probe and target sequence hybridize in 7% sodium dodecyl sulfate (SDS), 
0.5 M NaP0 4 , 1 mM EDTA at 50°C followed by washing in 0.1X SSC, 0.1% 
SDS at 50°C; more preferably, a probe and target sequence hybridize in 7% 
15 sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C followed 
by washing in 0.1X SSC. 0.1% SDS at 65°C. 

A further indication that two nucleic acid sequences are substantially 
identical is that proteins encoded by the nucleic acids are substantially 
identical, share an overall three-dimensional structure, are biologically 
20 functional equivalents, or are immunologically cross-reactive. These terms 
are defined further under the heading MS4A Polypeptides herein below. 
Nucleic acid molecules that do not hybridize to each other under stringent 
conditions are still substantially identical if the corresponding proteins are 
substantially identical. This can occur, for example, when two nucleotide 
25 sequences are significantly degenerate as permitted by the genetic code. 

The term "conservatively substituted variants" refers to nucleic acid 
sequences having degenerate codon substitutions wherein the third position 
of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues (Batzer et al. (1991) Nucleic Acids Res 19:5081; 
30 Ohtsuka et al. (1985) J Biol Chem 260:2605-2608; Rossolini et al. (1994) 
Mol Cell Probes 8:91-98). 
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The term "subsequence" refers to a sequence of nucleic acids that 
comprises a part of a longer nucleic acid sequence. An exemplary 
subsequence is a probe, described herein above, or a primer. The term 
"primer as used herein refers to a contiguous sequence comprising about 8 
5 or more deoxyribonucleotides or ribonucleotides, preferably 10-20 
nucleotides, and more preferably 20-30 nucleotides of a selected nucleic 
acid molecule. The primers of the invention encompass oligonucleotides of 
sufficient length and appropriate sequence so as to provide initiation of 
polymerization on a nucleic acid molecule of the present invention. 
10 The term "elongated sequence" refers to an addition of nucleotides 

(or other analogous molecules) incorporated into the nucleic acid. For 
example, a polymerase (e.g., a DNA polymerase), e.g., a polymerase which 
adds sequences at the 3' terminus of the nucleic acid molecule. In addition, 
the nucleotide sequence can be combined with other DNA sequences, such 
15 as promoters, promoter regions, enhancers, polyadenylation signals, intronic 
sequences, additional restriction enzyme sites, multiple cloning sites, and 
other coding segments. 

The term "complementary sequence", as used herein, indicates two 
nucleotide sequences that comprise antiparallel nucleotide sequences 
20 capable of pairing with one another upon formation of hydrogen bonds 
between base pairs. As used herein, the term "complementary sequences" 
means nucleotide sequences which are substantially complementary, as can 
be assessed by the same nucleotide comparison set forth above, or is 
defined as being capable of hybridizing to the nucleic acid segment in 
25 question under relatively stringent conditions such as those described 
herein. A particular example of a complementary nucleic acid segment is an 
antisense oligonucleotide. 

The term "gene" refers broadly to any segment of DNA associated 
with a biological function. A gene encompasses sequences including but not 
30 limited to a coding sequence, a promoter region, a cis-regulatory sequence, 
a non-expressed DNA segment is a specific recognition sequence for 
regulatory proteins, a non-expressed DNA segment that contributes to gene 
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expression, a DNA segment designed to have desired parameters, or 
combinations thereof. A gene can be obtained by a variety of methods, 
including cloning from a biological sample, synthesis based on known or 
predicted sequence information, and recombinant derivation of an existing 
5 sequence. 

The term "gene expression" generally refers to the cellular processes 
by which a biologically active polypeptide is produced from a DNA 
sequence. 

The present invention also encompasses chimeric genes comprising 

10 the disclosed MS4A sequences. The term "chimeric gene", as used herein, 
refers to a promoter region operably linked to a MS4A coding sequence, a 
nucleotide sequence producing an antisense RNA molecule, a RNA 
molecule having tertiary structure, such as a hairpin structure, or a double- 
stranded RNA molecule. 

15 The term "operably linked", as used herein, refers to a promoter 

region that is connected to a nucleotide sequence in such a way that the 
transcription of that nucleotide sequence is controlled and regulated by that 
promoter region. Techniques for operatively linking a promoter region to a 
nucleotide sequence are well known in the art. 

20 The terms "heterologous gene", "heterologous DNA sequence", 

"heterologous nucleotide sequence", "exogenous nucleic acid molecule", or 
"exogenous DNA segment", as used herein, each refer to a sequence that 
originates from a source foreign to an intended host cell or, if from the same 
source, is modified from its original form. Thus, a heterologous gene in a 

25 host cell includes a gene that is endogenous to the particular host cell but 
has been modified, for example by mutagenesis or by isolation from native 
cis-regulatory sequences. The terms also include non-naturally occurring 
multiple copies of a naturally occurring nucleotide sequence. Thus, the 
terms refer to a DNA segment that is foreign or heterologous to the cell, or 

30 homologous to the cell but in a position within the host cell nucleic acid 
wherein the element is not ordinarily found. 

The term "promoter region" defines a nucleotide sequence within a 
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gene that is positioned 5' to a coding sequence of a same gene and 
functions to direct transcription of the coding sequence. The promoter 
region includes a transcriptional start site and at least one cis-regulatory 
element. The present invention encompasses nucleic acid sequences that 

5 comprise a promoter region of a MS4A gene, or functional portion thereof. 

The term "cis-acting regulatory sequence" or "cis-regulatory motif or 
"response element", as used herein, each refer to a nucleotide sequence 
that enables responsiveness to a regulatory transcription factor. 
Responsiveness can encompass a decrease or an increase in 

1 0 transcriptional output and is mediated by binding of the transcription factor to 
the DNA molecule comprising the response element. 

The term "transcription factor" generally refers to a protein that 
modulates gene expression by interaction with the cis-regulatory element 
and cellular components for transcription, including RNA Polymerase, 

15 • Transcription Associated Factors (TAFs), chromatin-remodeling proteins, 
and any other relevant protein that impacts gene transcription. 

A "functional portion" of a promoter gene fragment is a nucleotide 
sequence within a promoter region that is required for normal gene 
transcription. To determine nucleotide sequences that are functional, the 

20 expression of a reporter gene is assayed when variably placed under the 
direction of a promoter region fragment. 

Promoter region fragments can be conveniently made by enzymatic 
digestion of a larger fragment using restriction endonucleases or DNAse I. 
Preferably, a functional promoter region fragment comprises about 5000 

25 nucleotides, more preferably 2000 nucleotides, more preferably about 1000 
nucleotides. Even more preferably a functional promoter region fragment 
comprises about 500 nucleotides, even more preferably a functional 
promoter region fragment comprises about 100 nucleotides, and even more 
preferably a functional promoter region fragment comprises about 20 

30 nucleotides. 

The terms "reporter gene" or "marker gene" or "selectable marker" 
each refer to a heterologous gene encoding a product that is readily 
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observed and/or quantitated. A reporter gene is heterologous in that it 
originates from a source foreign to an intended host cell or, if from the same 
source, is modified from its original form. Non-limiting examples of 
detectable reporter genes that can be operably linked to a transcriptional 

5 regulatory region can be found in Alam & Cook (1990) Anal Biochem 
188:245-254 and PCT International Publication No. WO 97/47763. 
Preferred reporter genes for transcriptional analyses include the lacZ gene 
(See , e.g., Rose & Botstein (1983) Meth Enzymol 101:167-180), Green 
Fluorescent Protein (GFP) (Cubitt et al. (1995) Trends Biochem Sci 20:448- 

10 455), luciferase, or chloramphenicol acetyl transferase (CAT). Preferred 
reporter genes for methods to produce transgenic animals include but are 
not limited to antibiotic resistance genes, and more preferably the antibiotic 
resistance gene confers neomycin resistance. Any suitable reporter and 
detection method can be used, and it will be appreciated by one of skill in 

15 the art that no particular choice is essential to or a limitation of the present 
invention. 

An amount of reporter gene can be assayed by any method for 
qualitatively or preferably, quantitatively determining presence or activity of 
the reporter gene product. The amount of reporter gene expression directed 

20 by each test promoter region fragment is compared to an amount of reporter 
gene expression to a control construct comprising the reporter gene in the 
absence of a promoter region fragment. A promoter region fragment is 
identified as having promoter activity when there is significant increase in an 
amount of reporter gene expression in a test construct as compared to a 

25 control construct. The term "significant increase", as used herein, refers to 
an quantified change in a measurable quality that is larger than the margin of 
error inherent in the measurement technique, preferably an increase by 
about 2-fold or greater relative to a control measurement, more preferably an 
increase by about 5-fold or greater, and most preferably an increase by 

30 about 10-fold or greater. 

The present invention further includes vectors comprising the 
disclosed MS4A sequences, including plasmids, cosmids, and viral vectors. 
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The term "vector", as used herein refers to a DNA molecule having 
sequences that enable its replication in a compatible host cell. A vector also 
includes nucleotide sequences to permit ligation of nucleotide sequences 
within the vector, wherein such nucleotide sequences are also replicated in a 
5 compatible host cell. A vector can also mediate recombinant production of a 
MS4A polypeptide, as described further herein below. Preferred vectors 
include but are not limited to pBluescript (Stratagene), pUC18, pBLCAT3 
(Luckow & Schutz (1987) Nucleic Acids Res 15:5490), pLNTK (Gorman et 
al. (1996) Immunity 5:241-252), and pBAD/glll (Stratagene). A preferred 
10 host cell is a mammalian cell; more preferably the cell is a Chinese hamster 
ovary cell, a HeLa cell, a baby hamster kidney cell, or a mouse cell; even 
more preferably the cell is a human cell. 

Nucleic acids of the present invention can be cloned, synthesized, 
recombinantly altered, mutagenized, or combinations thereof. Standard 
15 recombinant DNA and molecular cloning techniques used to isolate nucleic 
acids are well known in the art. Exemplary, non-limiting methods are 
described by Sambrook et al., eds. (1989); by Silhavy et al. (1984) 
Experiments with Gene Fusions , Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York; by Ausubel et al. (1992) Current Protocols in 
20 Molecular Biology . John Wylie and Sons, Inc., New York, New York; and by 
Glover, ed. (1985) DNA Cloning: A Practical Approach . MRL Press, Ltd., 
Oxford, United Kingdom. Site-specific mutagenesis to create base pair 
changes, deletions, or small insertions are also well known in the art as 
exemplified by publications, see, e.g., Adelman et al., (1983) DNA 2:183; 
25 Sambrook etal. (1989). 

Sequences detected by methods of the invention can be detected, 
subcloned, sequenced, and further evaluated by any measure well known in 
the art using any method usually applied to the detection of a specific DNA 
sequence including but not limited to dideoxy sequencing. PCR, oligomer 
30 restriction (Saiki et al. (1985) Bio/Technology 3:1008-1012), allele-specific 
oligonucleotide (ASO) probe analysis (Conner et al. (1983) Proc Natl Acad 
Sci USA 80:278), and oligonucleotide ligation assays (OLAs) (Landgren et. 
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al. (1988) Science 241:1007). Molecular techniques for DNA analysis have 
been reviewed (Landgren et. al. (1988) Science 242:229-237). 
I.B. MS4A Polypeptides 

The polypeptides provided by the present invention include the 
5 isolated polypeptides set forth as the even-numbered SEQ ID NOs:2-38, 
polypeptides substantially similar to the even-numbered SEQ ID NOs:2-38, 
MS4A polypeptide fragments, fusion proteins comprising MS4A amino acid 
sequences, biologically functional analogs, and polypeptides that cross-react 
with an antibody that specifically recognizes a MS4A polypeptide. 
10 The term "isolated ", as used in the context of a polypeptide, indicates 

that the polypeptide exists apart from its native environment and is not a 
product of nature. An isolated polypeptide can exist in a purified form or can 
exist in a non-native environment such as, for example, in a transgenic host 
cell. 

15 The term "purified", when applied to a polypeptide, denotes that the 

polypeptide is essentially free of other cellular components with which it is 
associated in the natural state. Preferably, a polypeptide is a homogeneous 
solid or aqueous solution. Purity and homogeneity are typically determined 
using analytical chemistry techniques such as polyacrylamide gel 

20 electrophoresis or high performance liquid chromatography. A polypeptide 
which is the predominant species present in a preparation is substantially 
purified. The term "purified" denotes that a polypeptide gives rise to 
essentially one band in an electrophoretic gel. Particularly, it means that the 
polypeptide is at least about 50% pure, more preferably at least about 85% 

25 pure, and most preferably at least about 99% pure. 

The term "substantially identical" in the context of two or more 
polypeptides sequences is measured by (a) polypeptide sequences having 
about 35%, or 45%, or preferably from 45-55%, or more preferably 55-65%, 
or most preferably 65% or greater amino acids which are identical or 

30 functionally equivalent. Percent "identity" and methods for determining 
identity are defined herein below under the heading Nucleotide and Amino 
Acid Sequence Comparisons . 
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Substantially identical polypeptides also encompass two or more 
polypeptides sharing a conserved three-dimensional structure. 
Computational methods can be used to compare structural representations, 
and structural models can be generated and easily tuned to identify 

5 similarities around important active sites or ligand binding sites. See 
Henikoff et al. (2000) Electrophoresis 21 (9):1 700-1 706; Huang et al. (2000) 
Pac Symp Biocomput 230-241; Saqi et al. (1999) Bioinformatics 15(6):521- 
522; and Barton (1 998) Acta Crystallogr D Biol Crystallogr 54: 1 1 39-1 1 46. 

The term "functionally equivalent" in the context of amino acid 

10 sequences is well known in the art and is based on the relative similarity of 
the amino acid side-chain substituents. See Henikoff & Henikoff (2000) Adv 
Protein Chem 54:73-97. Relevant factors for consideration include side- 
chain hydrophobicity, hydrophilicity, charge, and size. For example, 
arginine, lysine, and histidine are all positively charged residues; that 

15 alanine, glycine, and serine are all of similar size; and that phenylalanine, 
tryptophan, and tyrosine all have a generally similar shape. By this analysis, 
described further herein below, arginine, lysine, and histidine; alanine, 
glycine, and serine; and phenylalanine, tryptophan, and tyrosine; are defined 
herein as biologically functional equivalents. 

20 In making biologically functional equivalent amino acid substitutions, 

the hydropathic index of amino acids can be considered. Each amino acid 
has been assigned a hydropathic index on the basis of their hydrophobicity 
and charge characteristics, these are: isoleucine (+ 4.5); valine (+ 4.2); 
leucine (+ 3.8); phenylalanine (+ 2.8); cysteine (+ 2.5); methionine (+ 1.9); 

25 alanine (+ 1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (- 
0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and 
arginine (-4.5). 

The importance of the hydropathic amino acid index in conferring 
30 interactive biological function on a protein is generally understood in the art 
(Kyte et al. (1982) J Mol Biol 157:105.). It is known that certain amino acids 
can be substituted for other amino acids having a similar hydropathic index 
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or score and still retain a similar biological activity. In making changes based 
upon the hydropathic index, the substitution of amino acids whose 
hydropathic indices are within ±2 of the original value is preferred, those 
which are within ±1 of the original value are particularly preferred, and those 

5 within ±0.5 of the original value are even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids 
can be made effectively on the basis of hydrophilicity. U.S. Patent No. 
4,554,101 states that the greatest local average hydrophilicity of a protein, 
as governed by the hydrophilicity of its adjacent amino acids, correlates with 

10 its immunogenicity and antigenicity, i.e. with a biological property of the 
protein. It is understood that an amino acid can be substituted for another 
having a similar hydrophilicity value and still obtain a biologically equivalent 
protein. 

As detailed in U.S. Patent No. 4,554,101, the following hydrophilicity 
15 values have been assigned to amino acid residues: arginine (+ 3.0); lysine 
(+ 3.0); aspartate (+ 3.0+1); glutamate (+ 3.0±1); serine (+ 0.3); asparagine 
(+ 0.2); glutamine (+ 0.2); glycine (0); threonine (-0.4); proline (-0.5±1); 
alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); 
leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); 
20 tryptophan (-3.4) . 

In making changes based upon similar hydrophilicity values, the 
substitution of amino acids whose hydrophilicity values are within ±2 of the 
original value is preferred, those which are within ±1 of the original value are 
particularly preferred, and those within ±0.5 of the original value are even 
25 more particularly preferred. 

The present invention also encompasses MS4A polypeptide 
fragments or functional portions of a MS4A polypeptide. Such functional 
portion need not comprise all or substantially all of the amino acid sequence 
of a native MS4A gene product. The term "functional" includes any 
30 biological activity or feature of MS4A, including immunogenicity. 

The present invention also includes longer sequences of a MS4A 
polypeptide, or portion thereof. For example, one or more amino acids can 
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be added to the N-terminus or C-terminus of a MS4A polypeptide. Fusion 
proteins comprising MS4A polypeptide sequences are also provided within 
the scope of the present invention. Methods of preparing such proteins are 
known in the art. 

5 The present invention also encompasses functional analogs of a 

MS4A polypeptide. Functional analogs share at least one biological function 
with a MS4A polypeptide. An exemplary function is immunogenicity. In the 
context of amino acid sequence, biologically functional analogs, as used 
herein, are peptides in which certain, but not most or all, of the amino acids 

10 can be substituted. Functional analogs can be created at the level of the 
corresponding nucleic acid molecule, altering such sequence to encode 
desired amino acid changes. In one embodiment, changes can be 
introduced to improve the antigenicity of the protein. In another 
embodiment, a MS4A polypeptide sequence is varied so as to assess the 

1 5 activity of a mutant MS4A polypeptide. 

The present invention also encompasses recombinant production of 
the disclosed MS4A polypeptides. Briefly, a nucleic acid sequence encoding 
a MS4A polypeptide, or portion thereof, is cloned into a expression cassette, 
the cassette is introduced into a host organism, where it is recombinantly 

20 produced. 

The term "expression cassette" as used herein means a DNA 
sequence capable of directing expression of a particular nucleotide 
sequence in an appropriate host cell, comprising a promoter operably linked 
to the nucleotide sequence of interest which is operably linked to termination 

25 signals. It also typically comprises sequences required for proper translation 
of the nucleotide sequence. The expression cassette comprising the 
nucleotide sequence of interest can be chimeric. The expression cassette 
can also be one which is naturally occurring but has been obtained in a 
recombinant form useful for heterologous expression. 

30 The expression of the nucleotide sequence in the expression cassette 

can be under the control of a constitutive promoter or an inducible promoter 
which initiates transcription only when the host cell is exposed to some 
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particular external stimulus. Exemplary promoters include Simian virus 40 
early promoter, a long terminal repeat promoter from retrovirus, an action 
promoter, a heat shock promoter, and a metallothien protein. In the case of 
a multicellular organism, the promoter and promoter region can direct 

5 expression to a particular tissue or organ or stage of development. 
Exemplary tissue-specific promoter regions include a MS4A promoter, 
described herein. Suitable expression vectors which can be used include, 
but are not limited to, the following vectors or their derivatives: human or 
animal viruses such as vaccinia virus or adenovirus, yeast vectors, 

10 bacteriophage vectors (e.g., lambda phage), and plasmid and cosmids DNA 
vectors. 

The term "host cell", as used herein, refers to a cell into which a 
heterologous nucleic acid molecule has been introduced. Transformed cells, 
tissues, or organisms are understood to encompass not only the end product 
15 of a transformation process, but also transgenic progeny thereof. 

A host cell strain can be chosen which modulates the expression of 
the inserted sequences, or modifies and processes the gene product in the 
specific fashion desired. For example, different host cells have 
characteristic and specific mechanisms for the translational and post- 
20 transactional processing and modification (e.g., glycosylation, 
phosphorylation of proteins). Appropriate cell lines or host systems can be 
chosen to ensure the desired modification and processing of the foreign 
protein expressed. Expression in a bacterial system can be used to produce 
a non-glycosylated core protein product. Expression in yeast will produce a 
25 glycosylated product. Expression in animal cells can be used to ensure 
"native" glycosylation of a heterologous protein. 

Expression constructs are transfected into a host cell by any standard 
method, including electroporation, calcium phosphate precipitation, DEAE- 
Dextran transfection, liposome-mediated transfection, and infection using a 
30 retrovirus. The MS4A-encoding nucleotide sequence carried in the 
expression construct can be stably integrated into the genome of the host or 
it can be present as an extrachromosomal molecule. 
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Isolated polypeptides and recombinantly produced polypeptides can 
be purified and characterized using a variety of standard techniques that are 
well known to the skilled artisan. See, e.g. Ausubel et al. (1992), 
Bodanszky, et al. (1976) Peptide Synthesis . John Wiley and Sons, Second 
5 Edition, New York, New York and Zimmer et al. (1993) Peptides, pp. 393- 
394, ESCOM Science Publishers, B. V. 

I.C. Nucleotide and Amino Acid Sequence Comparisons 
The terms "identical" or percent "identity" in the context of two or more 
nucleotide or polypeptide sequences, refer to two or more sequences or 
10 subsequences that are the same or have a specified percentage of amino 
acid residues or nucleotides that are the same, when compared and aligned 
for maximum correspondence, as measured using one of the sequence 
comparison algorithms disclosed herein or by visual inspection.. 

The term "substantially identical" in regards to a nucleotide or 
15 polypeptide sequence means that a particular sequence varies from the 
sequence of a naturally occurring sequence by one or more deletions, 
substitutions, or additions, the net effect of which is to retain at least some of 
biological activity of the natural gene, gene product, or sequence. Such 
sequences include "mutant" sequences, or sequences wherein the biological 
20 activity is altered to some degree but retains at least some of the original 
biological activity. The term "naturally occurring", as used herein, is used to 
describe a composition that can be found in nature as distinct from being 
artificially produced by man. For example, a protein or nucleotide sequence 
present in an organism, which can be isolated from a source in nature and 
25 which has not been intentionally modified by man in the laboratory, is 
naturally occurring. 

For sequence comparison, typically one sequence acts as a 
reference sequence to which test sequences are compared. When using a 
sequence comparison algorithm, test and reference sequences are entered 
30 into a computer program, subsequence coordinates are designated if 
necessary, and sequence algorithm program parameters are selected. The 
sequence comparison algorithm then calculates the percent sequence 
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identity for the designated test sequence(s) relative to the reference 
sequence, based on the selected program parameters. 

Optimal alignment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith & Waterman (1981) AdvAppI 

5 Math 2:482, by the homology alignment algorithm of Needleman & Wunsch 
(1970) J Mol Biol 48:443, by the search for similarity method of Pearson & 
Lipman (1988) Proc Natl Acad Sci USA 85:2444-2448, by computerized 
implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 

10 Madison, Wl), or by visual inspection. See generally, Ausubel et al., 1992. 

A preferred algorithm for determining percent sequence identity and 
sequence similarity is the BLAST algorithm, which is described in Altschul et 
al. (1990) J Mol Biol 21 5: 403-410. Software for performing BLAST analyses 
is publicly available through the National Center for Biotechnology 

15 Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first 
identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score 

20 threshold. These initial neighborhood word hits act as seeds for initiating 
searches to find longer HSPs containing them. The word hits are then 
extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are 
calculated using, for nucleotide sequences, the parameters M (reward score 

25 for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring 
matrix is used to calculate the cumulative score. Extension of the word hits 
in each direction are halted when the cumulative alignment score falls off by 
the quantity X from its maximum achieved value, the cumulative score goes 

30 to zero or below due to the accumulation of one or more negative-scoring 
residue alignments, or the end of either sequence is reached. The BLAST 
algorithm parameters W, T, and X determine the sensitivity and speed of the 
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alignment. The BLASTN program (for nucleotide sequences) uses as 
defaults a wordlength W=1 1 , an expectation E=10, a cutoff of 100, M=5, N=- 
4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) 

5 of 10, and the BLOSUM62 scoring matrix. See Henikoff & Henikoff (1989) 
Proc Natl Acad Sci USA 89:10915. 

In addition to calculating percent sequence identity, the BLAST 
algorithm also performs a statistical analysis of the similarity between two 
sequences. See, e.g., Karlin and Altschul (1993) Proc Natl Acad Sci USA 

10 90:5873-5887. One measure of similarity provided by the BLAST algorithm 
is the smallest sum probability (P(N)), which provides an indication of the 
probability by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a test nucleic acid 
sequence is considered similar to a reference sequence if the smallest sum 

1 5 probability in a comparison of the test nucleic acid sequence to the reference 
nucleic acid sequence is less than about 0.1, more preferably less than 
about 0.01 , and most preferably less than about 0.001 . 
LP. Antibodies 

The present invention also provides an antibody that specifically binds 
20 a MS4A polypeptide. The term "antibody" indicates an immunoglobulin 
protein, or functional portion thereof, including a polyclonal antibody, a 
monoclonal antibody, a chimeric antibody, a single chain antibody, Fab 
fragments, and an Fab expression library. "Functional portion" refers to the 
part of the protein that binds a molecule of interest. In a preferred 
25 embodiment, an antibody of the invention is a monoclonal antibody. 
Techniques for preparing and characterizing antibodies are well known in 
the art (See, e.g., Harlow & Lane (1988) Antibodies: A Laboratory Manual . 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York). A 
monoclonal antibody of the present invention can be readily prepared 
30 through use of well-known techniques such as the hybridoma techniques 
exemplified in U.S. Patent No 4,196,265 and the phage-displayed 
techniques disclosed in U.S. Patent No. 5.260,203. 
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The phrase "specifically (or selectively) binds to an antibody", or 
"specifically (or selectively) immunoreactive with", when referring to a protein 
or peptide, refers to a binding reaction which is determinative of the 
presence of the protein in a heterogeneous population of proteins and other 

5 biological materials. Thus, under designated immunoassay conditions, the 
specified antibodies bind to a particular protein and do not show significant 
binding to other proteins present in the sample. Specific binding to an 
antibody under such conditions can require an antibody that is selected for 
its specificity for a particular protein. For example, antibodies raised to a 

10 protein with an amino acid sequence encoded by any of the nucleic acid 
sequences of the invention can be selected to obtain antibodies specifically 
immunoreactive with that protein and not with unrelated proteins. 

The use of a molecular cloning approach to generate antibodies, 
particularly monoclonal antibodies, . and more particularly single chain 

15 monoclonal antibodies, are also provided. The production of single chain 
antibodies has been described in the art. See, e.g., U.S. Patent No. 
5,260,203. For this approach, combinatorial immunoglobulin phagemid 
libraries are prepared from RNA isolated from the spleen of the immunized 
animal, and phagemids expressing appropriate antibodies are selected by 

20 panning on endothelial tissue. The advantages of this approach over 
conventional hybridoma techniques are that approximately 10 4 times as 
many antibodies can be produced and screened in a single round, and that 
new specificities are generated by heavy (H) and light (L) chain 
combinations in a single chain, which further increases the chance of finding 

25 appropriate antibodies. Thus, an antibody of the present invention, or a 
"derivative" of an antibody of the present invention, pertains to a single 
polypeptide chain binding molecule which has binding specificity and affinity 
substantially similar to the binding specificity and affinity of the light and 
heavy chain aggregate variable region of an antibody described herein. 

30 The term "immunochemical reaction", as used herein, refers to any of 

a variety of immunoassay formats used to detect antibodies specifically 
bound to a particular protein, including but not limited to competitive and 
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non-competitive assay systems using techniques such as 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoradiometric assays, gel diffusion 
precipitation reactions, immunodiffusion assays, in situ immunoassays (e.g., 
5 using colloidal gold, enzyme or radioisotope labels), western blots, 
precipitation reactions, agglutination assays (e.g., gel agglutination assays, 
hemagglutination assays), complement fixation assays, immunofluorescence 
assays, protein A assays, and immunoelectrophoresis assays, etc. See 
Harlow & Lane (1988) for a description of immunoassay formats and 

10 conditions. 

I.E. Protein Binding Assays 

The term "binding" refers to an affinity between two molecules, for 
example, a ligand and a receptor. As used herein, "binding" means a 
preferential binding of one molecule for another in a mixture of molecules. 

1 5 The binding of the molecules can be considered specific if the binding affinity 
is about 1 x 10 4 M" 1 to about 1 x 10 6 M" 1 or greater. Binding of two 
molecules also encompasses a quality or state of mutual action such that an 
activity of one protein or compound on another protein is inhibitory (in the 
case of an antagonist) or enhancing (in the case of an agonist). Exemplary 

20 protein binding assays include but are not limited to Fluorescence 
Correlation Spectroscopy (FCS), Surface-Enhanced Laser 
Desorption/lonization time-of-flight mass spectrometry (SELDI-TOF), and 
Biacore, each described further herein below. 

Fluorescence Correlation Spectroscopy (FCS) measures the average 

25 diffusion rate of a fluorescent molecule within a small sample volume 
(Madge et al. (1972) Phys Rev Lett 29:705-708; Maiti et al. (1997) Proc Natl 
Acad Sci USA, 94:11753-11757). The sample size can be as low as 10 3 
fluorescent molecules and the sample volume as low as the cytoplasm of a 
single bacterium. The diffusion rate is a function of the mass of the molecule 

30 and decreases as the mass increases. FCS can therefore be applied to 
protein-ligand interaction analysis by measuring the change in mass and 
therefore in diffusion rate of a molecule upon binding. In a typical 
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experiment, the target to be analyzed is expressed as a recombinant protein 
with a sequence tag, such as a poly-histidine sequence, inserted at the N- 
terminus or C-terminus. The expression takes place in E. coli, yeast or 
mammalian cells. The protein is purified using chromatographic methods. 

5 For example, the poly-histidine tag can be used to bind the expressed 
protein to a metal chelate column such as Ni 2+ chelated on iminodiacetic 
acid agarose. The protein is then labeled with a fluorescent tag such as 
carboxytetramethylrhodamine or BODIPY™ (Molecular Probes, Eugene, 
Oregon). The protein is then exposed in solution to the potential ligand, and 

10 its diffusion rate is determined by FCS using instrumentation available from 
Carl Zeiss, Inc. (Thomwood, New York). Ligand binding is determined by 
changes in the diffusion rate of the protein. 

Surface-Enhanced Laser Desorption/lonization (SELDI) was 
developed by Hutchens & Yip (1993) Rapid Commun Mass Spectrom 7:576- 

15 580). When coupled to a time-of-flight mass spectrometer (TOF), SELDI 
provides a means to rapidly analyze molecules retained on a chip. It can be 
applied to ligand-protein interaction analysis by covalently binding the target 
protein on the chip and analyzing by MS the small molecules that bind to this 
protein (Worrall et al. (1998) Anal Biochem 70:750-756). In a typical 

20 experiment, the target to be analyzed is expressed as described for FCS. 
The purified protein is then used in the assay without further preparation. It 
is bound to the SELDI chip either by utilizing the poly-histidine tag or by 
other interaction such as ion exchange or hydrophobic interaction. The chip 
thus prepared is then exposed to the potential ligand via, for example, a 

25 delivery system able to pipet the ligands in a sequential manner 
(autosampler). The chip is then washed in solutions of increasing 
stringency, for example a series of washes with buffer solutions containing 
an increasing ionic strength. After each wash, the bound material is 
analyzed by submitting the chip to SELDI-TOF. Ligands that specifically 

30 bind the target are identified by the stringency of the wash needed to elute 
them. 
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Biacore relies on changes in the refractive index at the surface layer 
upon binding of a ligand to a protein immobilized on the layer. In this 
system, a collection of smair ligands is injected sequentially in a 2-5 
microliter cell, wherein the protein is immobilized within the cell. Binding is 

5 detected by surface plasmon resonance (SPR) by recording laser light 
refracting from the surface. In general, the refractive index change for a 
given change of mass concentration at the surface layer is practically the 
same for all proteins and peptides, allowing a single method to be applicable 
for any protein (Liedberg et al. (1983) Sensors Actuators 4:299-304; 

10 Malmquist (1993) Nature 361:186-187). In a typical experiment, the target to 
be analyzed is expressed as described for FCS. The purified protein is then 
used in the assay without further preparation. It is bound to the Biacore chip 
either by utilizing the poly-histidine tag or by other interaction such as ion 
exchange or hydrophobic interaction. The chip thus prepared is then 

15 exposed to the potential ligand via the delivery system incorporated in the 
instruments sold by Biacore (Uppsala, Sweden) to pipet the ligands in a 
sequential manner (autosampler). The SPR signal on the chip is recorded 
and changes in the refractive index indicate an interaction between the 
immobilized target and the ligand. Analysis of the signal kinetics of on rate 

20 and off rate allows the discrimination between non-specific and specific 
interaction. 

I.F. Transgenic animals 

It is also within the scope of the present invention to prepare a 
transgenic animal to mutagenize the MS4A locus or to express a transgene 

25 comprising nucleic acid sequences of the present invention. The term 
"transgenic animal", indicates an animal comprising a germline insertion of a 
heterologous nucleic acid. Transgenic animals of the present invention are 
understood to encompass not only the end product of a transformation 
method, but also transgenic progeny thereof. 

30 The term "transgene", as used herein indicates a heterologous 

nucleic acid molecule that has been transformed into a host cell. For 
intended use in the creation of a transgenic animal, the transgene includes 
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genomic sequences of the host organism at a selected locus or site of 
transgene integration to mediate a homologous recombination event. A 
transgene further comprises nucleic acid sequences of interest, for example 
a targeted modification of the gene residing within the locus, a reporter gene, 
5 or a expression cassette, each defined herein above. 

Transgene integration can be used to create gene mutations, 
including "knock-out", "knock-in", or a "knock-down" mutations. 
Representative approaches are disclosed in the Examples presented below. 
The term "knock-out" refers to a homologous recombination event that 
10 renders a gene inactive. Gene knock-out is generally accomplished by 
integration of the transgene at a chromosomal loci, thereby interrupting a 
gene residing at that loci. The term "knock-in" refers to in vivo replacement 
at a targeted locus. Knock-in mutations can modify a gene sequence to 
create a loss-of-f unction or gain-of-fu notion mutation. The term "gene 
15 knock-down" refers to a homologous recombination event wherein the 
transgene partially eliminates gene function. A knock-down animal can be 
created by transgenic expression of an antisense molecule, wherein a 
transgene comprising the antisense sequence and a relevant promoter are 
integrated into the genome at a non-essential loci. Expression of the 
20 antisense or ribozyme molecule disrupts the corresponding gene function, 
although this disruption is generally incomplete (Luyckx et al. (1999) Proc 
Natl Acad Sci U SA 96(21):12174-12179). 

Conditional mutation can be accomplished using transgenic methods 
in combination with the Cre-recombinase system in mice. Briefly, in one 
25 instance, a transgenic mouse is derived that expresses Cre-recombinase 
under the direction of an inducible promoter. A second transgenic mouse 
bears a mutation of a gene of interest as well as a lox-P-flanked endogenous 
gene sequence. Such transgenic mice are mated, the resulting progeny 
having both the Cre-recombinase and lox-P-flanked transgenes. Induction 
30 of Cre recombinase catalyzes excision of the lox-P-flanked transgene, 
thereby excising a portion of the endogenous gene sequence and revealing 
the mutated sequence. Conditional knockout can be varied according to the 
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temporal and spatial features of Cre recombinase expression, inherent in the 
selection of a promoter to drive Cre recombinase. See Postic et al. (1999) J 
Biol Chem 275(1):305-315; and Sauer (1998) Methods 14(4):381-392. 

Transgenes can also be used for heterologous expression in a host 

5 organism without generating phenotypically apparent mutations. By this 
method, nucleotide sequences of interest are introduced into the genome at 
a nonessential loci, whereby insertion alone does not disrupt an essential 
gene function. Optionally, expression of the transgene can generate a gain- 
of-function or ectopic function phenotype. 

10 Techniques for the preparation of transgenic animals are known in the 

art. Exemplary techniques are described in U.S. Patent No. 5,489,742 
(transgenic rats); U.S. Patent Nos. 4,736,866, 5,550,316, 5,614,396, 
5,625,125 and 5,648,061 (transgenic mice); U.S. Patent No. 5,573,933 
(transgenic pigs); 5,162,215 (transgenic avian species) and U.S. Patent No. 

15 5,741,957 (transgenic bovine species). Briefly, nucleotide sequences of 
interest are cloned into a vector, and the construct is transformed into a 
germ cell. In the germ cell, a chromosomal rearrangement event takes 
place wherein the nucleic acid sequences of interest are integrated into the 
genome of the germ cell by homologous recombination. Fertilization and 

20 propagation of the transformed germ cell results in a transgenic animal. 
Homozygosity of the mutation is accomplished by intercrossing. 
I.G. Therapeutic Methods 

The present invention further provides methods for discovering 
substances that can be used as pharmaceutical compositions. The term 
25 "pharmaceutical composition" or "drug" as used herein, each refer to any 
substance having a biological activity. Substances discovered by methods 
of the present invention include but are not limited to polypeptide, proteins, 
peptides, chemical compounds, and antibodies. 

A composition of the present invention is typically formulated using 
30 acceptable vehicles, adjuvants, and carriers as desired. 

Among the acceptable vehicles and solvents that can be employed 
are water, Ringers solution, and isotonic sodium chloride solution. In 
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addition, sterile, fixed oils are conventionally employed as a solvent or 
suspending medium. For this purpose any bland fixed oil can be employed 
including synthetic mono- or di-glycerides. In addition, fatty acids such as 
oleic acid find use in the preparation of injectable compositions. 

5 Injectable preparations, for example sterile injectable aqueous or 

oleaginous suspensions, are formulated according to the known art using 
suitable dispersing or wetting agents and suspending agents. The sterile 
injectable preparation can also be a sterile injectable solution or suspension 
in a nontoxic diluent or solvent, for example 1 ,3-butanediol. 

10 A vector can be used as a carrier, for example an adenovirus vector, 

can be used for gene therapy methods. The vector is purified to sufficiently 
render it essentially free of undesirable contaminants, such as defective 
interfering adenovirus particles or endotoxins and other pyrogens such that it 
does not cause any untoward reactions in the individual receiving the vector 

15 construct. A preferred means of purifying the vector involves the use of 
buoyant density gradients, such as cesium chloride gradient centrifugation. 

A transfected cell can also serve as a carrier. By way of example, a 
liver cell can be removed from an organism, transfected with a nucleic acid 
sequence of the present invention using methods set forth above and then 

20 the transfected cell returned to the organism (e.g. injected intra-vascularly). 

Monoclonal antibodies or polypeptides of the invention can be 
administered parenterally by injection or by gradual infusion over time. 
Although the tissue to be treated can typically be accessed in the body by 
systemic administration and therefore most often treated by intravenous 

25 administration of therapeutic compositions, other tissues and delivery means 
are provided where there is a likelihood that the tissue targeted contains the 
target molecule and are known to those of skill in the art. 

Representative antibodies for use in the present invention are intact 
immunoglobulin molecules, substantially intact immunoglobulin molecules, 

30 single chain immunoglobulins or antibodies, those portions of an 
immunoglobulin molecule that contain the paratope, including antibody 
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fragments. It is within the scope of the present invention that a monovalent 
modulator can optionally be used. 

Methods of preparing "humanized" antibodies are generally well 
known in the art, and can readily be applied to the antibodies of the present 
5 invention. Humanized monoclonal antibodies offer particular advantages 
over monoclonal antibodies derived from other mammals, particularly insofar 
as they can be used therapeutically in humans. Specifically, humanized 
antibodies are not cleared from the circulation as rapidly as "foreign" 
antigens, and do not activate the immune system in the same manner as 
1 0 foreign antigens and foreign antibodies. 

With respect to the therapeutic methods of the present invention, a 
preferred subject is a vertebrate subject. A preferred vertebrate is warm- 
blooded; a preferred warm-blooded vertebrate is a mammal. A preferred 
mammal is a mouse or, most preferably, a human. As used herein and in 
15 the claims, the term "patient" includes both human and animal patients. 
Thus, veterinary therapeutic uses are provided in accordance with the 
present invention. 

Also provided is the treatment of mammals such as humans, as well 
as those mammals of importance due to being endangered, such as 
20 Siberian tigers; of economical importance, such as animals raised on farms 
for consumption by humans; and/or animals of social importance to humans, 
such as animals kept as pets or in zoos. Examples of such animals include 
but are not limited to: carnivores such as cats and dogs; swine, including 
pigs, hogs, and wild boars; ruminants and/or ungulates such as cattle, oxen, 
25 sheep, giraffes, deer, goats, bison, and camels; and horses. Also provided 
is the treatment of birds, including the treatment of those kinds of birds that 
are endangered and/or kept in zoos, as well as fowl, and more particularly 
domesticated fowl, i.e., poultry, such as turkeys, chickens, ducks, geese, 
guinea fowl, and the like, as they are also of economical importance to 
30 humans. Thus, provided is the treatment of livestock, including, but not 
limited to, domesticated swine, ruminants, ungulates, horses, poultry, and 
the like. 
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As used herein, the term "experimental subject" refers to any subject 
or sample in which the desired measurement is unknown. The term "control 
subject" refers to any subject or sample in which a desired measure is 
unknown. 

5 As used herein, an "effective" dose refers to a dose(s) administered to 

an individual patient sufficient to cause a change in MS4A activity. One of 
ordinary skill in the art can tailor the dosages to an individual patient, taking 
into account the particular formulation and method of administration to be 
used with the composition as well as patient height, weight, severity of 

10 symptoms, and stage of the biological condition to be treated. Such 
adjustments or variations, as well as evaluation of when and how to make 
such adjustments or variations, are well known to those of ordinary skill in 
the art of medicine. 

A therapeutically effective amount can comprise a range of amounts. 

15 One skilled in the art can readily assess the potency and efficacy of a MS4A 
modulator of this invention and adjust the therapeutic regimen accordingly. 
A modulator of MS4A biological activity can be evaluated by a variety of 
means including the use of a responsive reporter gene, interaction of MS4A 
polypeptides with a monoclonal antibody, analysis of cell subpopulations, 

20 and measurement of [Ca 2+ ]j levels, each technique described herein. 

Additional formulation and dose techniques have been described in 
the art, see for example, those described in U.S. Patent Nos. 5,326,902 and 
5,234,933, and International Publication No. WO 93/25521. 

For the purposes described above, the identified substances can 

25 normally be administered systemically, parenterally, or orally. The term 
"parenteral" as used herein includes intravenous, intra-muscular, intra- 
arterial injection, or infusion techniques. Other compositions for 
administration include liquids for external use, and endermic liniments 
(ointment, etc.), suppositories, and pessaries which comprise one or more of 

30 the active substance(s) and can be prepared by known methods. 
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HA Identification of CD20 Gene Family Members 
The present invention provides MS4A nucleic acid and polypeptide 
sequences. Preferably, a MS4A gene comprises the sequence set forth as 
5 any one of the odd-numbered SEQ ID NOs:1-37, a nucleic acid molecule 
that is substantially similar to any one of the odd-numbered SEQ ID NOs:1- 
37, or a nucleic acid molecule comprising a 20 base pair nucleotide 
sequence that is identical to a contiguous 20 base pair sequence of any one 
of the odd-numbered SEQ ID NOs:1-37. 
10 To identify new CD20 gene family members, the human and mouse 

CD20 amino acid sequences (Tedder et al., 1988a; Tedder et al., 1988b) 
were used to search the translated GenBank databases, including 
expressed sequence tags, using the BLAST program (Altschul et al., 1997). 
Among 337 homologous sequences identified, at least 17 novel genes 
15 expressed by mouse, human, and pig had predicted amino acid sequences 
homologous to CD20. Complete coding regions were predicted using 
overlapping nucleotide sequences obtained from sequenced ESTs and 
cDNAs that corresponded to unique, near full-length transcripts in humans 
and mice (Figure 1). All nucleotide sequences were verified by sequencing 
20 multiple near full-length cDNAs isolated by applicants and 40 cDNAs 
obtained from the ATCC (American Tissue Culture Collection, Bethesda, 
Maryland, USA). In addition, a pig cDNA and its human counterpart 
homologous to CD20 were identified as GenBank submissions AJ236932.1 
and AK000224, respectively. In total, unique cDNA clones were identified 
25 that encode at least 1 6 distinct full-length CD20-like proteins. 

Complete cDNA sequences encoding the human and mouse MS4A 
family members (MS4A1, -A2, -A3, -AAA, -A5, -A6A, -Al, -A8B and -A12) 
were also used to search the GenBank human genomic database (htgs; 
http://www.ncbi.nlm.nih.gov/blast/) using the BLAST program (Altschul et al., 
30 1997), as further described in Example 2. Two-hundred-twenty different 
contigs or distinct genomic DNA sequences were identified in the database 
of unfinished human genomic sequences that were either identical or similar 
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to MS4A family members. These sequences were predominantly derived 
from sixteen partially sequenced bacterial artificial chromosomes (BACs) 
that spanned 400-500 kb of human chromosome 11q12 (Table 1). Based on 
known cDNA sequences of MS4A family members, we were able to order 

5 and arrange these genomic sequences into overlapping continuous DNA 
segments. Since many of the contigs identified were overlapping, it was 
thereby possible to assemble long DNA sequences that encoded entire 
MS4A genes or portions of MS4A genes. Gaps between exon encoding 
DNA sequences were filled in many cases by additional sequence homology 

10 searches using DNA sequences found at the ends of gaps. When sequence 
differences were observed between different overlapping DNA fragments, 
consensus sequences were used or PCR primers were generated, that 
portion of genomic DNA was then amplified and sequenced to resolve 
ambiguous sequences. 

15 BLAST analysis of the htgs phase 1 or phase 2 human genomic DNA 

sequences encoding MS4A cDNAs and the assembled and annotated 
human genomic sequence thereof, as disclosed herein, revealed the 
presence of each known human MS4A family member. In addition, three 
putative genes encoding unique MS4A family members were identified that 

20 localized to the q1 2-1 3.1 region of human chromosome 11. Complete 
coding regions were predicted using overlapping nucleotide sequences 
obtained from sequenced ESTs and cDNAs and by comparison of gene 
structure, described further herein below (Figure 2). 

By identifying sequences that correlated with different MS4A genes in 

25 each BAC (Table 1), and by the assembly of minimal genomic DNA lengths 
that could encode each MS4A gene (Figure 2), we used the overlapping 
BACs to identify the order of the MS4A genes on chromosome 11q12 
(Figure 6). This analysis also allowed us to determine the direction of gene 
transcription for most MS4A genes. Furthermore, the MS4A cDNA 

30 sequences, disclosed herein, were used to assemble genomic clones set 
forth as SEQ ID NOs:73-81. In some cases, multiple MS4A genes could be 
aligned within a continuous genomic sequence. For example, the genomic 
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sequence set forth as SEQ ID NO:77 comprises both the MS4A4E and 
MS4A6A genes. Similarly, the genomic region set forth as SEQ ID NO:79 
comprises three MS4A genes: MS4A7, MS4A5, and MS4A12. 

The MS4A4E gene encodes 660 bp of translated sequence (Figure 

5 3), contained within at least seven exons (Figure 2). Exons were identified 
based on their sequence similarities with MS4A4A sequences and the 
identification of canonical splice-donor and -acceptor sites (Aebi & 
Weissmann, 1 987). The MS4A4E gene sequence was at least 23,379 base 
pairs in length, if counted from the putative translation initiation ATG site until 

10 the TGA translation termination stop site (Figure 2). An exon encoding the 
putative 5* untranslated region of MS4A4E, was highly homologous with the 
corresponding sequence in MS4A4A cDNAs (disclosed herein). This 
sequence homology extended for >7 kbp upstream from this putative exon 
and also included upstream repetitive Alu elements. Representative 

15 upstream homologous sequences are shown in Figure 3. Similar sequence 
homologies were identified in the 3' untranslated regions of MS4A4E and 
MS4A4A, which extended beyond the poly-A attachment signal sequences 
(Figure 3). Based on the sequence similarities in translated and 
untranslated exons, it appears that the MS4A4E and MS4A4A genes 

20 resulted from a recent gene duplication event. 

The MS4A6E gene encodes 441 bp of translated sequence (Figure 
4), contained within at least four exons (Figure 2). Exons were identified 
based on their sequence similarities with MS4A6A cDNA sequences and the 
identification of canonical splice-donor and -acceptor sites (Aebi & 

25 Weissmann, 1987). In addition, the predicted gene sequences matched 
those found in three cDNA clones that were sequenced (ATCC Nos. 
3704466, 1852248 and 3557769). The MS4A6E gene was at least 5,060 bp 
in length, if counted from the putative translation initiation ATG site until the 
TGA translation termination codon (Figure 2). The MS4A6E gene lacks 

30 exons that encode the first two membrane spanning domains present in 
most MS4A family proteins (Figures 2 and 7). An exon homologous with the 
5' untranslated region of MS4A6A cDNAs was not identified within 7,629 bp 
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of sequence upstream of the exon encoding the translation initiation site of 
MS4A6E. However, there was a canonical 3' splice region upstream of the 
ATG initiation codon located at identical positions in the MS4A6E and 
MS4A6A genes. Similar sequence homologies were identified in the 3' 

5 untranslated regions of MS4A6E and MS4A6A that extend beyond the 
sequence shown in Figure 4. Based on the sequence similarities in 
translated and untranslated exons, it appears that the MS4A6E and 
MS4A6A genes represent a recent gene duplication event, although several 
exons encoding translated sequence were lost in the MS4A6E gene (Figure 

10 2). 

The MS4A10 gene encodes 726 bp of translated sequence (Figure 
5), contained within at least six exons (Figure 2). Exons were identified 
based on their sequence similarities with mouse MS4a10 cDNA sequences 
and the identification of canonical splice-donor and -acceptor sites (Aebi & 

15 Weissmann, 1987). The MS4A10 gene was at least 8,183 bp in length if 
counted from the putative translation initiation ATG site until the TGA 
translation termination stop site (Figure 2). An exon homologous with the 5' 
untranslated region of mouse MS4a10 cDNAs was not identified within 8,829 
bp of sequence upstream of the exon encoding the translation initiation site 

20 of MS4A10. However, there was a canonical 3' splice region upstream of 
the ATG initiation codon located at identical positions in the MS4A10 and 
MS4a10 genes. Modest sequence homologies were identified in the 3' 
untranslated regions of MS4A10 and MS4a10 (Figure 5). 
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Table 1 

Human BACs Containing MS4A Genes 



DAP 
bAU 


Accession No. 3 


Chromosome 


MS4A gene 0 


RP11-206B10 


AC009703 


15 


A4A, A4E, A6A 


Kr 1 l-z id ih 


AC013807 


unknown 


A6A, A2, A3 


RP11-24D1 


AC015840 


unknown 


A4A, A5, A6E, A7 




AC0 18966 


11 


A4A, A4E, A6A 


RP11-44fiIM3 


AC024066 


11 


A8B 


Krll-o i^pJ 1 f 


AC027599 


11 


A8B, A10 


Kr 1 1- lyDt ID 


AC027787 


15 


A5, A1 


pfcjinQ 7QR9 


AP000748 

Til WW/ » ' ^ 


11q23 


A10 


RP1 1-804 A23 


AP000777 


11 


A10 


RP1 1-736110 


AP000790 


11q12 


A3 


RP11-804B24 


ArUUUyo** 


11 


A10 


RP11-729B4 


AP001034 


11q12 


A5, A12, A1 


CMB9-2M23 


AP001181 


11q12 


A2.A3 


CMB9-100I1 


AP001257 


11q12 


A6A, A4E 


CMB9-49F18 


AP001259 


11 


A8B 


RP11-68H20 


AP001986 


11q 


A10 



a GenBank Accession number for the indicated BAC. 
5 indicates the MS4A gene sequences that mapped to each 
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11. B. MS4A Nomenclature 

In collaboration with the Human Gene Nomenclature Committee 
( www.Qene.ucl.ac.uk/nomenclature/) , this gene family was designated as the 
MS4A family (Membrane Spanning 4-domain family, subfamily A). The MS4 

5 designation is to accommodate the future identification of genes encoding 
proteins with a similar structure, yet with unresolved functions. Subfamily A 
will designate the CD20 family. Using this nomenclature, the CD20 gene 
was designated as MS4A1, FceRIp as MS4A2, and HTm4 as MS4A3. 
Among the 16 novel genes identified, 8 human genes were named MS4A4A, 

10 MS4A4E, MS4A5, MS4A6A, MS4A6E, MS4A7, MS4A8B, and MS4A12. A 
ninth gene encoded a protein homologous with the single member of the 
mouse MS4a10 subfamily. This gene was tentatively designated as 
MS4A10. The remaining genes were of mouse or pig origin and were 
therefore labeled as MS4a3-MS4a12 based on the nomenclature of 

15 homologous genes corresponding to human counterparts. Distinct mouse 
genes that encoded proteins with highly homologous sequences were 
designated as MS4a4B, MS4a4C, MS4a4D, and as MS4a6B, MS4a6C, and 
MS4a6D to signify close homology. 

II.C. MS4A Gene Chromosome Locations 

20 Chromosome locations for the human MS4A4A, MS4A6A, MS4A7, 

and MS4A8B genes were identified in two distinct homology searches. 
Regions of human MS4A4A (bp 1286-1588), MS4A6A (bp 682-1106), 
MS4A7 (bp 502-941), MS4A7 (bp 1015-1177), and MS4A8B (bp 1007- 
1350), were 98%, 98%, 97%, 99% and 97% identical with human STS 

25 genomic sequence tag sites, WM1578, SHGC-36634, WI-12101, WIAF- 
3856, and WI-14145, respectively (http://www.ncbi.nlm.nih.qov/blast) . 
These genomic sequence tag sites are located on human chromosome 1 1 at 
Genomic Database locus D11S1357-D11S913, which maps to 11q12-13 
(http://www.ncbi.nlm.nih.gov/qenemap) . These mapping results were 

30 confirmed using the UniGene collection at the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov/Genemap98/) for 
expressed sequence tags identical to human MS4A4A, MS4A6A, MS4A7, 
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MS4A8B sequences. By this analysis, at least 7 of the 9 currently identified 
human MS4A genes are clustered. 

The organization of the 12 MS4A genes on human chromosome 11 
was determined by identifying sequenced human genomic DNA fragments 
5 (contigs of different lengths) from 15 BAC clones (Table 1). Contiguous 
DNA segments for each BAC were constructed based on human MS4A 
exon and cDNA sequences, and overlapping contigs. Although some gaps 
were present in MS4A gene introns (Figure 2) or between MS4A genes, the 
relative position of each gene on chromosome 11q12-13.1 was determined 
10 (Figure 6). MS4A1 was located in a telemetric region of 11q12-13.1 
compared with MS4A2 and MS4A3. Seven MS4A genes were located in 
between MS4A1 and MS4A2. Two other MS4A genes, MS4A8B and 
MS4A10 were centromeric to MS4A2 and MS4A3, although the distance 
between these genes was not determined. Interestingly, MS4A6A, 
15 MS4A4E, MS4A4A and MS4A6E were arranged linearly suggesting that 
these genes might have arisen through the duplication of a single genomic 
element. It is envisioned that this genetic locus extends further and contains 
additional MS4A genes. 

1 1 .A. MS4A Gene Structure 
20 Complete coding region sequences were verified for each deduced 

protein, except for the MS4a3 cDNA that was not full-length (Figure 1). 
Proposed ATG translation initiation codons were based on the translation 
initiation consensus sequence, ANN ATG (Kozak (1986) Cell 44:283-292), 
and the existence of in-frame upstream translation stop codons in most 
25 cases. Whether the first or second ATG codon in mouse MS4a8B was used 
for translation initiation was unknown although the second ATG was identical 
with the start codon of human MS4A8B (Figure 7). 

Poly(A) attachment signal sequences were identified in the proximal 
3' untranslated regions of each gene product except MS4A6A, MS4A6E, 
30 MS4A10, and MS4a6C. Two poly(A) signal sequences were found in 
MS4a4D, MS4A5, and MS4a10 transcripts, while four were observed in 
MS4A4A transcripts. 
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The disclosed MS4A cDNAs were further used to annotate the 
genomic sequence derived from BAC clones. Annotated features include 
definition of coding regions, intron|exon junctions, sequences upstream of 
the initial coding region of each gene that comprise the promoter region, and 
5 other adjacent sequences that could also comprise gene regulatory 
elements. Representative methods for further characterizing a MS4A 
promoter region are disclosed in Example 9. 

Annotation of human MS4A genomic regions (SEQ ID NOs:73-81), as 
disclosed herein, enabled a comparison of gene structure among MS4A 
10 genes. The overall domain organization of each MS4A gene was similar 
(Figures 2 and 7). All exon|intron|exon boundaries were consistent with 
consensus splice-donor and -acceptor sequences unless otherwise 
indicated, with exon|GTGAGT-intron-CAG|exon sequences in most cases 
(Aebi & Weissmann, 1987). In addition, the splice junctions for all translated 
15 exons were located after the third nucleotide in each codon. Most MS4A 
proteins were encoded by 6 exons except MS4A2, MS4A5, and MS4A6E 
(Figure 2 and 7). In these exceptions: the N-terminal cytoplasmic domain of 
MS4A2 was encoded by two exons (Kuster et al., 1992); the MS4A5 and 
MS4A6E genes did not encode C-terminal cytoplasmic domains; and the 
20 MS4A6E gene had only two membrane spanning domains. Intron lengths 
demonstrated wide variation from 181 bp in MS4A12 to 13,731 bp in 
MS4A5. In some cases however, exact intron lengths were not determined; 
MS4A3, MS4A4, and MS4A12 (Figure 2). Distances between translation 
initiation and termination codons were determined for most MS4A genes; 
25 with MS4A6E being the smallest (5,060 bp) and MS4A4E being the longest 
(23,379 bp) genes (Figure 6). Thus, the intron|exon organization of all 
MS4A family members is consistent with the high degree of conservation 
within this gene family. 

There were no amino-terminal signal sequences, although all MS4A 
30 proteins contained hydrophobic regions of sufficient length to pass through 
the membrane at least four times. Notable was a marked clustering of 
charged residues at both ends of the putative transmembrane domains, 



WO 02/062946 



48 



PCT/US01/48437 



some of which were highly conserved. In some cases, the first and second 
putative transmembrane domains of MS4A proteins were a continuous 
stretch of hydrophobic amino acids without an obvious inter-transmembrane 
hydrophilic bridge. By contrast, MS4A4A and MS4A7 had 6 to 7 hydrophilic 
5 amino acids inserted between the first and second hydrophobic domains. In 
human MS4A4A and mouse MS4a4B, MS4a4C, and MS4a4D, an extensive 
hydrophobic region followed the fourth putative membrane-spanning 
domain. Thus, the overall structure of MS4A family members was well 
conserved. 

10 UJL MS4A Gene Splice Variants 

Among the MS4A cDNAs sequenced and EST sequences analyzed, 
multiple splice variants were identified that encoded variant MS4A proteins. 
In most cases, exons were spliced out, which generated truncated protein 
products. Potential splice variants of the MS4A4A, MS4A5, MS4A6A, and 

15 MS4A7 genes were identified. Whether these alternatively spliced variants 
produce functional proteins has yet to be determined. 

Two splice variations of the MS4A4A gene were identified during an 
analysis of MS4A4A mRNA expression by lymphoblastoid cell lines. Most of 
the hematopoietic cell lines examined expressed transcripts encoding a full- 

20 length MS4A4A protein as shown in Figure 7. However, a second smaller 
transcript was also expressed in most cases that contained a potential exon 
deletion of 158 nucleotides. This was a frequent event since 40% of 
MS4A4A cDNAs generated from the BJAB B cell line encoded the truncated 
protein. In addition, the same splicing event was observed in two of five 

25 EST sequences that covered this region of the MS4A4A protein. Splicing- 
out this potential exon deleted the third membrane-spanning domain and the 
second extracellular loop from the full-length protein (positions 110-163, 
Figure 3). Of interest, this splicing event fused the first/second membrane 
spanning domains with the fourth membrane spanning domain. However, 

30 the fourth transmembrane spanning domain in MS4A4A is followed by 
another hydrophobic region of sufficient length to traverse the membrane 
(disclosed herein). This suggests that differential splicing can generate an 
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alternative MS4A4A protein with four membrane spanning domains lacking a 
significant extracellular domain. 

In the case of the MS4A5 gene, two of nine MS4A5 EST sequences 
analyzed (GenBank Accession Nos. AA411806 and AA781801) encoded a 
5 splice variant that preserved the reading frame of the transcript. In both 
sequences, the exon encoding the third membrane-spanning domain and 
the second extracellular loop from the full-length protein (TM3, Figure 1) was 
spliced out using normal splice-donor and -acceptor sequences, which 
deleted 51 amino acids (114-164) from the full length protein (Figure 7). 

10 This deletion resulted in a protein with the first/second membrane spanning 
domains fused with the fourth predicted membrane-spanning domain. Thus, 
the truncated MS4A5 protein would possess three membrane-spanning 
domains with an extracellular carboxyl-terminal domain. 

A novel splicing event was observed in the MS4A6A gene which 

1 5 resulted in a truncated protein. A novel splice donor site (CAG T 683 |GT GAG 
T) is located within the exon encoding the TM3/extracellular loop domains 
(Figure 4). This cryptic splice donor site was spliced with the normal 3' 
splice acceptor site of the exon encoding the TM4 domain, which thereby 
deletes nucleotides 684-787 from MS4A6A transcripts (Figure 4). Since 

20 there was an extra T introduced into the codon sequence due to this 
alternative splicing event, there was a frameshift in the coding sequence. 
This potentially results in the attachment of a novel 30 amino acid sequence 
(-WNSLSDADLHSAGILPSCAHCCAAVETGLL) that is not predicted to be 
hydrophobic. Thus, the variant MS4A protein would be 70 amino acids 

25 shorter and would lack the fourth membrane-spanning and cytoplasmic 
domains. This alternative splicing event was found in 3 of 29 EST 
sequences that encoded this region (GenBank Accession Nos. AI278475, 
AA461046, and AA448335) and in one cDNA clone (GenBank Accession 
No. AB013104). 

30 Splice variation in MS4A7A transcripts produces two distinct protein 

products in addition to the presumably normal protein. In one case, a splice 
variation in MS4A7A transcripts produces a protein product similar in 
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structure to the MS4A6E protein. The exon encoding the first/second 
membrane spanning domains (amino acids 50-94, Figure 7) was deleted in 
2 of 4 MS4A7 EST sequences analyzed (GenBank Accession Nos. N42191 
and R1 1 179) that cover this region. Thus, the protein product would have a 
5 longer N-terminal cytoplasmic domain and only two membrane spanning 
domains. In the second case, the exon encoding the fourth membrane- 
spanning domain (amino acids 183-216) was deleted in 2 EST sequences 
(GenBank Accession Nos. R11180 and AI188478) out of 18 sequences 
analyzed (Figure 7). 
10 ILR MS4A Gene Polymorphisms 

Putative polymorphisms were identified in the MS4A6A gene. Two 
nucleotide substitutions were found in cDNA clone ATCC No. 499181 and in 
13 of 38 EST sequences analyzed (Figure 1). The first substitution was at 
nucleotide 373 that exchanged a C for a T, which did not alter the amino 
15 acid sequence. The second substitution resulted in a Ser in place of Thr at 
amino acid 185. In addition, a third substitution was found in 4 of the 38 
EST sequences analyzed where a Ser was substituted in place of an Ala at 
amino acid position 183. This substitution was paired with a Ser to Thr 
substitution at amino acid position 185 in half of the clones analyzed. These 
20 differences most likely represent common sequence polymorphisms since 
they were observed in multiple independent cDNA clones. Based on our 
genetic DNA analysis, it is unlikely that these differences could represent 
transcripts from distinct genes that are almost identical in coding sequence. 
As with the MS4A6A gene (disclosed herein), potential gene 
25 polymorphisms were observed in MS4A6E. Three cDNA clones 
representing partial transcripts were sequenced completely on both strands. 
The predicted MS4A6E gene product and one cDNA clone (ATCC No. 
3704466) had identical sequences. However, the ATCC No. 3557769 cDNA 
had a nucleotide substitution at position 314 (Figure 4) that exchanged a T 
30 for a C, which did not alter the predicted amino acid sequence. The ATCC 
No.1 852248 cDNA clone had the longest insert that starts at nucleotide 
position 60 and ended at position 661 as shown in Figure 4. This cDNA had 
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a substitution at nucleotide 153 that exchanged a G for a T, which resulted in 
a Phe in place of Val at amino acid 47 (Figure 4). Therefore, sequence 
polymorphisms can exist within the MS4A6E gene. 

Other potential polymorphisms were observed in other MS4A family 
5 members based on consistent nucleotide variations found in MS4A4E 
sequences. 

The assembly and annotation of genomic sequences comprising 
MS4A genes in the region of human chromosome 11q12-13.1, disclosed 
herein for the first time, provide source material for identification of 

10 polymorphisms that are linked to MS4A genes. Such polymorphisms can 
include single nucleotide polymorphisms as disclosed within the MS4A6A 
and MS4A6E coding region sequences. In addition, polymorphisms within 
or genetically linked to MS4A genes can also comprise restriction length 
polymorphisms (RFLPs) (Lander & Botstein (1989) Genetics 121:185-199), 

15 short tandem repeat polymorphisms (STRPs), short sequence length 
polymorphisms (SSLPs) (Dietrich et at. (1996) Nature 380:149-152), 
amplified fragment length polymorphisms (AFLPs) (Latorra et al. (1994) PCR 
Methods Appl 3(6):351-358), and microsatellite markers (Schalkwyk et al. 
(1999) Genome Res 9:878-887). Identification of polymorphisms within an 

20 isolated DNA molecule are known to one of skill in the art. 
II. G. MS4A Proteins 

The MS4A genes encoded proteins of 16-29 kDa (Table 2). 
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Table 2 
MS4A Family Members 



Human 


Mouse 


Human/Mouse 


Name 


kDa 


Name 


kDa 


Hnmnloav 

I iui i njiv/y y 






MS4a3 




63% (partial) 


MS4A4A 


23 












Ms4a4B 


24 


41% 






Ms4a4C 


24 


44% 






Ms4a4D 


24 


40% 


MS4A4E 


24 








MS4A5 


22 








MS4A6A 


27 












Ms4a6B 


27 


52% 






Ms4a6C 


24 


51% 






Ms4a6D 


26 


coo/ 
0.3% 


MS4A6E 


16 








MS4A7 


26 


MS4a7 


26 


53% 


MS4A8B 


26 


MS4a8B 


29 


63% 


MS4A10 


27 


MS4a10 


29 


52% 


MS4A12 


26 


MS4a12( P ig) 


26 


60% 



Predicted molecular weights for the new MS4A family members and the 
5 percentage amino acid sequence identity between deduced MS4A and 
MS4a proteins. 
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Comparisons between CD20 and the predicted amino acid 
sequences for human MS4A4A, MS4A5, MS4A6A, MS4A7, MS4A8B, and 
MS4A12 revealed 23-29% amino acid sequence identity (Figure 7). The 
highest degree of identity was found in the first three transmembrane 

5 domains with multiple regions of conserved amino acids. In particular, the 
amino acid sequences LGAXQI (SEQ ID NO:57) and LSLG (SEQ ID NO:58) 
were common within the first transmembrane domain, GYPFWG (SEQ ID 
NO:60) and FIISGSLS (SEQ ID NO:61) were common in the second 
domain, and SLX2NX2SX3AX2G (SEQ ID NO:62) was found in the third 

10 transmembrane domain, the first and second transmembrane domains of 
MS4A8B were 46% identical in amino acid sequence with human CD20, 
41% identical with FceRip, and 39% identical with HTm4. The MS4A4A, 
MS4A5, MS4A6A, and MS4A7 proteins were most homologous in their first 
and second transmembrane domains with the human FceRip chain, with 37- 

15 46% amino acid sequence identity. There was large variation between 
MS4A proteins in the N- and C-terminal cytoplasmic domains. However, 
Pro residues were significantly over-represented within the N- and C- 
terminal cytoplasmic domains of most MS4A family members. There was 
some sequence identity in the first potential extracellular loop that was -13 

20 amino acids in length for each protein. By contrast, the second predicted 
extracellular loop ranged from 10-46 amino acids in length with diverse 
sequences. 

The putative MS4A4E gene encodes a 220 amino acid protein of 23.8 
kDa with a predicted amino acid sequence that is 76% identical with the 

25 MS4A4A protein (Figure 3). Consistent with other MS4A proteins, the most 
significant homologies between MS4A4E and other MS4A family members 
were found in the membrane spanning domains (Figure 7). Common amino 
acid motifs were readily visualized such as KXLGAIQI (SEQ ID NO:57), 
GYPXWG (SEQ ID NO:60), and SGXLSI (SEQ ID NO:59) in the first and 

30 second hydrophobic regions that represent potential transmembrane 
regions. The intracellular N- and C-terminal domains were highly conserved 
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between MS4A4E and MS4A4A, but were divergent from other family 
members. 

The putative MS4A6E gene encodes a 147 amino acid protein of 15.9 
kDa with a predicted amino acid sequence that is 78% identical with the 

5 MS4A6A protein (Figure 4). The most significant homologies between 
MS4A6E and other MS4A family members were found in the membrane 
spanning domains, although MS4A6E only had two (TM3 and TM4) 
membrane-spanning domains (Figures 4 and 7). The putative second 
extracellular loops of MS4A6E and MS4A6A were of identical length (Figure 

10 4). Common amino acid motifs were readily visualized in the hydrophobic 
regions that represent potential transmembrane regions. The intracellular N- 
terminal domain was highly conserved between MS4A6E and MS4A6A, but 
were divergent from other family members. MS4A6E protein also lacks a C- 
terminal cytoplasmic domain (Figure 4). 

15 The putative MS4A10 gene encodes a translated 241 amino acid 

protein of 26.9 kDa with a predicted amino acid sequence that is 52% 
identical with the mouse MS4a10 protein (Figure 5). The most significant 
homologies between MS4A10 and MS4a10 were found in the membrane 
spanning domains and the putative second extracellular loop (Figure 5). 

20 Although the N-terminal cytoplasmic domains of MS4A10 and MS4a10 were 
of similar length, the intracellular N- and C-terminal domains had the lowest 
sequence homologies among domains. The cytoplasmic C-terminal domain 
was 28 amino acids shorter in MS4A10 than MS4a10. Nonetheless, based 
on the sequence similarities of translated regions, it appears that MS4A10 

25 and MS4a10 represent homologous genes that are more similar to one 
another than other MS4A family members. 

Ten novel mouse MS4A proteins were identified that shared 40-63% 
amino acid sequence identity with their potential human counterparts (Figure 
7, Table 2). For comparison, the mouse and human CD20 proteins are 74% 

30 identical in amino acid sequence (Tedder et al., 1988a). A single partial 
cDNA was identified that encoded the mouse homologue for HTm4 (MS4a3, 
Figure 7). The predicted amino terminus of the proposed MS4a3 protein 
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was 23 amino acids shorter than in the human protein, although their 
overlapping regions were 63% identical in amino acid sequence. In all 
cases, the transmembrane domains of the human and mouse MS4A 
proteins were the most well conserved regions. For example, the human 

5 MS4A8B protein was 78% identical in sequence to MS4a8B in the first 3 
transmembrane domains and 68% identical in domain 4. Additional MS4A 
genes are likely to be identified in humans and mice, including the mouse 
MS4A5 homologue. 

A UPGMA (unweighted pair group method using arithmetic averages) 

10 tree showing relatedness of deduced MS4A and MS4a protein sequences is 
depicted in Figure 8. 

ML Methods for Detecting a MS4A Nucleic Acid Molecule 

In another aspect of the invention, a method is provided for detecting 

15 a nucleic acid molecule that encodes a MS4A polypeptide. According to the 
method, a biological sample having nucleic acid material is procured and 
hybridized under stringent hybridization conditions to a MS4A nucleic acid 
molecule of the present invention. Such hybridization enables a nucleic acid 
molecule of the biological sample and the MS4A nucleic acid molecule to 

20 form a detectable duplex structure. Preferably, the MS4A nucleic acid 
molecule includes some or all nucleotides of any one of the odd-numbered 
SEQ ID NOs:1-37. Also preferably, the biological sample comprises human 
nucleic acid material. 

HLA. Expression of MS4A Family Members in Hematopoietic Cells 

25 Since CD20, FceRip, and HTm4 expression are restricted to 

hematopoietic tissues, MS4A gene transcription was assessed by PCR 
amplification of cDNA from eleven human hematopoietic cell lines. Like 
CD20, MS4A8B was only expressed by B cell lines (Table 3). MS4A5 was 
only expressed by a promonocyte cell line. MS4A6A transcripts were 

30 expressed by B cell, myelomonocytic, and erythroleukemia cell lines. 
MS4A4A mRNA was expressed by all cell lines examined, although the 
relative mRNA levels varied significantly. MS4A7 was expressed in most, 
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but not all of the cell lines tested. MS4A12 transcripts were not detected in 
these cell lines. Thus, most MS4A family members are likely to be 
expressed in hematopoietic tissues. 

ESTs encoding MS4A transcripts were isolated from a variety of 
5 different cDNA libraries. MS4A4A ESTs were from aorta, brain, breast, 
heart, kidney, lung, ovary, pancreas, placenta, prostate, stomach, testis, and 
uterine tissues. MS4A5 ESTs were only isolated from testis. MS4A6A ESTs 
were from aorta, brain, the central nervous system, colon, gall bladder, 
heart, kidney, lung, muscle, ovary, pancreas, placenta, prostate, skin, 

10 stomach, tonsil, uterus and embryonic tissues. MS4A7 ESTs were from 
lung, kidney, lymphocytes, mammary gland, placenta, spleen, testis, thymus, 
and uterine tissues. MS4A8B ESTs were from brain, lung, uterus and 
embryonic tissues. A single MS4A12 EST was isolated from colon. This 
demonstrates differential MS4A gene transcription among lymphoid and 

1 5 non-lymphoid tissues. 
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Since most of the MS4A genes are expressed by hematopoietic cells, 
MS4A4E, MS4A6E and MS4A10 transcription were assessed by RT-PCR 
amplification of cDNA from human hematopoietic cell lines and human 
tissues. Transcripts from eleven human hematopoietic cell lines were 
5 evaluated; one pre-B cell line (NALM-6), three B cell lines (BJAB, DAUDI, 
and SB), four T cell lines (HSB-2, HUT-78, JURKAT, and MOLT15), two 
myelomonocytic lines (HL60 and U937), and one erythroleukemia cell line 
(K562). In addition, transcripts from eight human tissues were evaluated; 
colon, ovary, peripheral blood leukocytes, prostate, small intestine, spleen, 

10 testes and thymus. However, MS4A4E, MS4A6E and MS4A10 transcripts 
were not detected in any of these cell lines or tissues. 

MS4A4E, MS4A6E, and MS4A10 sequences were also used to 
search the translated GenBank databases using the BLAST program 
(Altschul et al., 1997). Eleven EST sequences representing MS4A6E 

15 transcripts were found that represented nine cDNAs isolated from pooled 
fetal organ libraries (GenBank Accession Nos. AA382998, AA909515, 
AA917066, AI222355, AI279944, AI684553, AI699419, AI743473, 
AI806247), one cDNA from a pooled germ cell tumor library .(GenBank 
Accession No. AI968835), and one cDNA from a colon tumor (GenBank 

20 Accession No. AW951636). EST cDNAs encoding MS4A4E or MS4A10 
sequences were not identified. This suggests that MS4A4E, MS4A6E, and 
MS4A10 transcripts are rare among normal tissues or they are primarily 
expressed during oncogenesis or embryogenesis. 

MS4a gene expression by mouse tissues was assessed by Northern 

25 analysis and PGR amplification of cDNAs (Table 4). In most cases 
assessed, Northern analysis failed to detect specific MS4a transcripts in 
tissues that revealed transcript production by PCR amplification. These 
results suggest that MS4a transcripts are only produced by subpopulations 
of cells within each tissue such that transcript levels were often below the 

30 level of detection by Northern analysis. Nonetheless, MS4a4B t MS4a4C, 
and MS4a6B transcripts were found at high levels in thymus, spleen and 
peripheral lymph nodes, with less abundant levels in non-lymphoid tissues. 
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MS4a6C was only expressed by thymus, spleen, PLN and bone marrow. 
MS4a4C, MS4a6D and MS4a7 were expressed in all tissues examined. 
MS4a8B transcripts were expressed by spleen, peripheral lymph nodes, 
colon, liver, heart, lung and bone marrow. MS4a10 transcripts were found in 

5 thymus, kidney, colon, brain, and testis. In addition, CD20 {MS4a1), FceRip 
(MS4a2), and MS4a3 expression were primarily restricted to hematopoietic 
tissues. MS4a3, MS4a4B, MS4a4C, MS4a6B, MS4a6C, MS4a6D, MS4a7, 
MS4a8B, and MS4a10 were also expressed by various hematopoietic and 
lymphoblastoid cell lines. Therefore, most MS4a family members were 

1 0 expressed by hematopoietic cells. 

Table 4 

MS4a Gene Expression hy Mouse Tissues 3 
MS4a Thymus Spleen PLN BM Liver Kidney Heart Colon Lung Brain Test€ 



1 + +++ +++ + 

2 + + + +++ 



3 ++++++---- + 

4B +++ +++ +++ ++ + + + + + 

4C +++ +++ +++ ++ + + 



4D + + ++ 

6B +++ 



+ + + + + + + 
+ + ++++++- + 



+++ +++ ++ + - + + + . ++ 

6C + + + ++-- - _ - - - 

6D +++ +++ +++ ++ +++ +++ +++ +++ +++ +++ +++ 

j ++ ++ ++++ + + + ++ + + + 

8B . + + + + -+++ + -- 



10 + 



+ . + - + ++ 



G3PDH +++ +++ +++ +++ +++ +++ +++ +++ +++ +++ ++4 

a Gene transcription was assessed by PCR amplification of cDNA generated 
from mRNA isolated from tissue samples. Values represent the level of 
15 PCR product generated relative to the glyceraldehyde-3-phosphate 
dehydrogenase (G3PDH) control as described for Table 3. Peripheral lymph 
node (PLN) and bone marrow (BM). 
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Expression of MS4A family members was also assessed in mouse 
hematopoietic cell lines (Table 5). Nine of the twelve MS4A genes were 
expressed in pre-B cell lines and five of the MS4A genes were expressed in 
B cell lines. Six of the MS4A genes were expressed by T cell lines. These 
5 data suggest that B cells can express most members of the MS4A gene 
family, although the patterns of expression of each gene is distinct. 

Table 5 

MS4a Expression by Mouse Lymphoid Tissues and Cell Lines 3 
Tissues Pre B cell lines B cell lines T cell lines 



MS4a 


Spleen 


Thymus 


300.19 


38B9 


70Z 


A20 


AJ9 


BW514 


EL-14 


1 


+++ 


+ 








+++ 


+++ 






2 


+ 


+ 






+ 










3 


+ 


+ 




+ 












4B 


+++ 


+++ 














++ 


4C 


+++ 


+++ 




+ 


+ 


++ 






+ 


4D 


+ 


+ 
















6B 


+++ 


+++ 




+ 


+++ 


+ 




+++ 


+++ 


6C 


+ 






+ 


+ 








+ 


6D 


+++ 


+++ 




++ 


+++ 




+ 




+++ 


7 


++ 


++ 






++ 










8B 


+ 








++ 










10 




+ 






+ 


+ 


+ 


+ 




G3PDH 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 



10 



a Gene transcription was assessed by PCR amplification of cDNA generated 
from mRNA isolated from each cell type. Values represent the level of PCR 
product generated relative to the glyceraldehyde-3-phosphate 
dehydrogenase (G3PDH) control in three separate PCR reactions: no 
15 specific PCR product detected; +, low levels of the appropriate band were 
detectable; ++ to +++, appropriate bands of increasing intensity were readily 
visualized in all samples examined. Identical results were obtained using 
two different primer pairs for cDNA amplification. 
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IM.B. Detection of MS4A Polymorphisms 

In another embodiment, genetic assays based on nucleic acid 
molecules of the present invention can be used to screen for genetic 
variants by a number of PCR-based techniques, including single-strand 

5 conformation polymorphism (SSCP) analysis (Orita, M., et al. (1989) Proc 
Natl Acad Sci USA 86(8):2766-2770), SSCP/heteroduplex analysis, enzyme 
mismatch cleavage, and direct sequence analysis of amplified exons (Kestila 
et al. (1998) Mol Cell 1 (4):575-582; Yuan et al. (1999) Hum Mutat 14(5):440- 
446). Automated methods can also be applied to large-scale 

10 characterization of single nucleotide polymorphisms (Brookes (1999) Gene 
234(2): 177-1 86; Wang et al. (1998) Science 280(5366): 1077-82). The 
present invention further provides assays to detect a mutation of a variant 
MS4A locus by methods such as allele-specific hybridization (Stoneking et 
al. (1991 ) Am J Hum Genet 48(2):370-82), or restriction analysis of amplified 

1 5 genomic DNA containing the specific mutation. 

IV. Recombinant Production of a MS4A P olypeptide 

The present invention also provides a method for recombinant 
production of a MS4A polypeptide, as described in Example 3. Preferably, 
20 the recombinant polypeptide comprises some or all of the amino acid 
sequences of any one of the even-numbered SEQ ID NOs:2-38. 

Recombinantly produced proteins are useful for a variety of purposes, 
including structural determination of a MS4A polypeptide, generation of an 
antibody that recognizes a MS4A polypeptide, and screening assays to 
25 identify a chemical compound or peptide that interacts with a MS4A 
polypeptide, described further herein below. 

V. Production of MS4A Antibodies 

In another aspect, the present invention provides a method of 
30 producing an antibody immunoreactive with a MS4A polypeptide, the 
method comprising recombinantly or synthetically producing a MS4A 
polypeptide, or portion thereof, to be used as an antigen. The MS4A 
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polypeptide is formulated so that it is can be used as an effective 
immunogen. An animal is immunized with the formulated MS4A 
polypeptide, generating an immune response in the animal. The immune 
response is characterized by the production of antibodies that can be 
5 collected from the blood serum of the animal. Optionally, cells producing a 
MS4A antibody can be fused with myeloma cells, whereby a monoclonal 
antibody can be selected. Exemplary methods for producing a monoclonal 
antibody that recognizes a MS4A protein are described in Example 4. 
Preferred embodiments of the method use a polypeptide set forth as any 

1 0 one of the even-numbered SEQ ID NOs:2-38. 

The present invention also encompasses antibodies and cell lines 
that produce monoclonal antibodies as described herein. 

The foregoing antibodies can be used in methods known in the art 
relating to the localization and activity of the MS4A polypeptide sequences of 

15 the invention, e.g., for cloning of MS4A nucleic acids, immunopurification of 
MS4A polypeptides, imaging MS4A polypeptides in a biological sample, 
measuring levels thereof in appropriate biological samples, and in diagnostic 
methods. 

20 VL Methods for Detecting a MS4A Polypeptide 

In another aspect of the invention, a method is provided for detecting 
a level of MS4A polypeptide using an antibody that specifically recognizes a 
MS4A polypeptide, or portion thereof. In a preferred embodiment, biological 
samples from an experimental subject and a control subject are obtained, 

25 and MS4A polypeptide is detected in each sample by immunochemical 
reaction with the MS4A antibody. More preferably, the antibody recognizes 
amino acids of any one of the even-numbered SEQ ID NOs:2-38, and is 
prepared according to a method of the present invention for producing such 
an antibody. 

30 In one embodiment, a MS4A antibody is used to screen a biological 

sample for the presence of a MS4A polypeptide. A biological sample to be 
screened can be a biological fluid such as extracellular or intracellular fluid, 
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or a cell or tissue extract or homogenate. A biological sample can also be 
an isolated cell (e.g., in culture) or a collection of cells such as in a tissue 
sample or histology sample. A tissue sample can be suspended in a liquid 
medium or fixed onto a solid support such as a microscope slide. In 
5 accordance with a screening assay method, a biological sample is exposed 
to an antibody immunoreactive with a MS4A polypeptide whose presence is 
being assayed, and the formation of antibody-polypeptide complexes is 
detected. Techniques for detecting such antibody-antigen conjugates or 
complexes are well known in the art and include but are not limited to 
10 centrifugation, affinity chromatography and the like, and binding of a labeled 
secondary antibody to the antibody-candidate receptor complex. 

In one embodiment, an antibody that specifically recognizes a MS4A 
polypeptide can be used to assess the tissue- or cell-distribution of MS4A 
protein, for example, to evaluate CD20 expression during B lymphocyte 
15 development (Figure 9). CD20 expression in B220+ lymphocytes from 
lymphoid tissues of wild type mice was examined by two-color 
immunofluorescence. In bone marrow, three types of B220* cells were 
detected. The vast majority of B220 hi lymphocytes expressed CD20. 
However, the majority of B220 10 lymphocytes were CD20-negative. Thus, 
20 CD20 was predominantly expressed by mature B cells. 

CD19 expression is restricted to normal and neoplastic B cells and 
follicular dendritic cells. CD19 is expressed early by B progenitor cells in the 
bone marrow, presumably at the late pro-B or early pre-B cell stages around 
the time of immunoglobulin heavy chain rearrangement (Anderson et al. 
25 (1984) Blood 63:1424). Expression persists during all stages of B cell 
maturation and is lost upon terminal differentiation to plasma cells. 

Double staining of CD20 with IgM and CD19 antibodies showed that 
some of the CD19 10 and IgM 10 cells were CD20 negative in the bone marrow. 
A few IgM- cells also expressed low levels of CD20 in the bone marrow. 
30 This data suggested that the CD20 expression was later than the CD19 
expression but before or around the time of IgM expression during B cell 
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development in the bone marrow since these cells were gated on 
lymphocytes not dendritic cells. 

jhe level of CD20 expression observed on mature B220 hi B cells in 
bone marrow was maintained by B cells from peripheral lymphoid tissues. 
5 The vast majority of B220 + B cells in the spleen, blood, peripheral lymph 
nodes, and peritoneal cavity expressed CD20. Therefore, like human CD20, 
mouse CD20 was also exclusively expressed on B cells from the immature B 
cell stage to mature B cells. 

10 VIL Identification of MS4A Modulators 

VILA. Screening for Small Molecule Liaands that Interact with a 

MS4A Polypeptide 
The present invention further discloses a method for identifying a 
compound that modulates MS4A function. According to the method, a 
15 MS4A polypeptide is exposed to a plurality of compounds, and binding of a 
compound to the isolated MS4A polypeptide is assayed. A compound is 
selected that demonstrates specific binding to the isolated MS4A 
polypeptide. Preferably, the MS4A polypeptide used in the binding assay of 
the method includes some or all amino acids of any one of the even- 
20 numbered SEQ ID NOs:2-38. 

Several techniques can be used to detect interactions between a 
protein and a chemical ligand without employing an in vivo ligand. 
Representative methods include, but are not limited to, Fluorescence 
Correlation Spectroscopy, Surface-Enhanced Laser Desorption/lonization 
25 Time-Of-flight Spectroscopy, and Biacore technology, as described in 
Example 5. These methods are amenable to automated, high-throughput 
screening. 

Candidate regulators include but are not limited to proteins, peptides, 
and chemical compounds. Structural analysis of these selectants can 
30 provide information about ligand-target molecule interactions that enable the 
development of pharmaceuticals based on these lead structures. 
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Similarly, the knowledge of the structure a native MS4A polypeptide 
provides an approach for rational drug design. The structure of a MS4A 
polypeptide can be determined by X-ray crystallography or by computational 
algorithms that generate three-dimensional representations. See Huang et 
5 al. (2000) Pac Symp Biocomput 230-41; Saqi et al. (1999) Bioinformatics 
15:521-522. Computer models can further predict binding of a protein 
structure to various substrate molecules, that can be synthesized and tested. 
Additional drug design techniques are described in U.S. Patent Nos. 

5,834,228 and 5,872,011. 
10 vh.b. Methods for Identifying Modulators of MS4A Gene Expression 

The assembly and annotation of genomic sequences comprising 
MS4A genes in the region of human chromosome 11q12-13.1, disclosed 
herein for the first time, identify MS4A gene regulatory regions. Preferably, 
MS4A gene regulatory regions comprise sequences upstream of the initial 
15 coding region of each MS4A gene as disclosed in SEQ ID NOs:73-81. An 
expression cassette comprising a MS4A promoter region can be employed 
in assays for the identification of modulators of MS4A expression. Thus the 
present invention also provides a method for identifying a substance that 
regulates MS4A gene expression using a chimeric gene that includes an 
20 isolated MS4A gene promoter region operably linked to a reporter gene. 
According to this method, a gene expression system is established that 
includes the chimeric gene and components required for gene transcription 
and translation so that reporter gene expression is assayable. To select a 
substance that regulates MS4A gene expression, the method further 
25 provides the steps of using the gene expression system to determine a 
baseline level of reporter gene expression in the absence of a candidate 
regulator; providing one or more candidate regulators to the gene expression 
system; and assaying a level of reporter gene expression in the presence of 
a candidate regulator. A candidate regulator is selected whose presence 
30 results in an altered level of reporter gene expression when compared to the 
baseline level. 

Several molecular cloning strategies can be used to identify 
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substances that specifically bind a MS4A gene cis-regulatory element. In 
one embodiment, a cDNA library in an expression vector, such as the 
Iambda-gt1 1 vector, can be screened for cDNA clones that encode a MS4A 
gene regulatory element DNA-binding activity by probing the library with a 

5 labeled MS4A DNA fragment, or synthetic oligonucleotide (Singh et al. 
(1989) Biotechniques 7:252-261). Preferably, the nucleotide sequence 
selected as a probe has already been demonstrated as a protein binding site 
using a protein-DNA binding assay, as described in Example 9. 

In another embodiment, transcriptional regulatory proteins are 

10 identified using the yeast one-hybrid system (Luo et al. (1996) Biotechniques 
20(4):564-568; Vidal et al. (1996) Proc Natl Acad Sci USA 93(1 9):1 0315- 
10320; Li & Herskowitz (1993) Science 262:1870-1874). In this case, a cis- 
regulatory element of a MS4A gene is operably fused as an upstream 
activating sequence (UAS) to one, or typically more, yeast reporter genes 

15 such as the lacZ gene, the URA3 gene, the LEU2 gene, the HIS3 gene, or 
the LYS2 gene, and the reporter gene fusion construct(s) is inserted into an 
appropriate yeast host strain. It is expected that the reporter genes are not 
transcriptionally active in the engineered yeast host strain, for lack of a 
transcriptional activator protein to bind the UAS derived from the MS4A gene 

20 promoter region. The engineered yeast host strain is transformed with a 
library of cDNAs inserted in a yeast activation domain fusion protein 
expression vector, e.g. pGAD, where the coding regions of the cDNA inserts 
are fused to a functional yeast activation domain coding segment, such as 
those derived from the GAL4 or VP16 activators. Transformed yeast cells 

25 that acquire a cDNA encoding a protein that binds a cis-regulatory element 
of a MS4A gene can be identified based on the concerted activation the 
reporter genes, either by genetic selection for prototrophy (e.g. LEU2, HIS3, 
or LYS2 reporters) or by screening with chromogenic substrates (e.g., a lacZ 
reporter) by methods known in the art. 

30 The present invention also provides an in vivo assay for discovery of 

modulators of MS4A gene expression. In this case, a transgenic non-human 
animal is made such that a transgene comprising a MS4A gene promoter 
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and a reporter gene is expressed and a level of reporter gene expression is 
assayable. Such transgenic animals can be used for the identification of 
compounds that are effective in modulating MS4A gene expression. 
In vitro or in vivo screening approaches can also survey more than one 
5 modulatable transcriptional regulatory sequence simultaneously. 



VIII. Animal Models 

The present invention further pertains to an animal model of disorders 
associated with a MS4A nucleic acid or polypeptide, including but not limited 
10 to atopic disorders, abnormal target cell development, function, and Ca ++ 
responses. Such a model can be prepared by several methods. Using a 
transgenic approach, knock-out, knock-in, or knock-down mutation of the 
MS4A gene can suppress MS4A function. The present invention also 
teaches that an animal model of a MS4A-related disorder can be prepared 
15 by immunizing an animal with a MS4A polypeptide. The resulting immune 
response in the animal comprises a production of antibodies that specifically 
bind a MS4A polypeptide, thereby disrupting its biological activity. A method 
is also provided for generating an animal model of a MS4A-related disorder 
by administering to an animal a compound that disrupts MS4A expression or 
20 function. Such a compound is discovered by methods disclosed herein. 
VIH.A. feneration o* r.n?0-Deficient Mice 

CD20-deficient mice were generated by targeted disruption of the 
CD20 gene in embryonic stem (ES) cells using homologous recombination, 
as described in Example 6. A targeting vector was generated that replaces 

25 exons encoding part of the second extracellular loop, the 4 th transmembrane 
domain, and the large carboxyl-terminal cytoplasmic domain of CD20 with a 
neomycin resistant gene (Figure 10A-D). Appropriate gene targeting 
generates an aberrant CD20 protein truncated at amino acid position 157 
and fused with an 88 amino acid protein encoded by the Neo r gene promoter 

30 sequence. 

After DNA transfections, 6 of 1 15 Neo-resistant ES cell clones carried 
the targeted allele as determined by Southern blot analysis of EcoR V 
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digested genomic DNA using a 1.5 kb DNA probe (Figure 10D). Appropriate 
targeting was further verified in two clones by Southern analysis of ES cell 
DNA digested with BamH I (>12 kb fragment was reduced to a 6.5 kb band 
in targeted cells), Kpn I (7.2 kb became 5.5 kb), and Ssp I (5.6 kb became 
5 7.0 kb) using the same probe. Cells of one ES cell clone were injected into 
blastocysts that were transferred into foster mothers. Highly chimeric male 
offspring (80-100% according to coat color) bred with C57BL/6 (B6) females 
transmitted the mutation to their progeny (Figure 10E). Mice homozygous 
for disruption of the CD20 gene were obtained at the expected Mendelian 

1 0 frequency by crossing heterozygous offspring. 

Appropriate targeting of the CD20 gene was further verified by PCR 
analysis of genomic DNA from homozygous offspring (Figure 10F). Wild 
type CD20 mRNA was absent in CD20"'' mice as confirmed by PCR 
amplification of cDNA generated from splenocytes of CD20"'" mice (Figure 

15 10G). CD20-deficient mice (CD20"'") thrived and reproduced as well as their 
wild type littermates and did not present any obvious anatomical or 
morphological abnormalities during the first year of life. 

Absence of cell surface CD20 protein expression in CD20" 7 " mice was 
further verified by staining B220 + splenocytes with murine anti-CD20 

20 monoclonal antibodies. Hybridomas producing these antibodies were 
generated using splenocytes from CD20" 7 * mice that were immunized with 
CD20-GFP cDNA-transfected 300.19 cells. Ten hybridomas secreted 
antibodies reactive with 300.19 (Figure 10H) and CHO (Figure 101) cells 
transfected with CD20-GFP cDNA, but not with untransfected CHO or 

25 300.19 cells (Table 6). These antibodies also reacted with CD20 epitopes 
expressed on the cell surface of B220 + splenocytes from wild type mice, but 
not with splenocytes from CD20"'" mice (Figure 10J). Therefore, targeted 
mutation of the CD20 gene abrogated cell surface CD20 protein expression. 
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^r~^TNams te<wT CD20-CHO CHO CD20-300.19 300.19 Spiee, 



Ab Name 


Clone Name 


Isotype 


MB20-1 


MCD20-5 


lgG1,K 


MB20-2 


MCD20-61 


igGl.K 


MB20-3 


MCD20-86 


lgG3, K 


MB20-6 


MCD20-223 


lgG2a, K 


MB20-7 


MCD20-243 


lgG2b, K 


MB20-8 


MCD20-270 


lgG2b, K 


MB20-10 


MCD20-388 


lgG2b, K 


MB20-11 


MCD20-392 


lgG2a, K 


MB20-13 


MCD20-624 


lgG3, K 


MB20-14 


MCD20-642 


IgGI.K 



++ 



++ 



++ 



a Values represent reactivity of the monoclonal antibody with adherent 
5 monolayers of CHO cells either transfected or untransfected with CD20-GFP 
cDNA as assessed by a cell-based ELISA. The monoclonal antibodies did 
not react with GFP cDNA-transfected CHO cells. 

b Cell surface reactivity of the monoclonal antibody with single cell 
suspensions of 300.19 cells either transfected or untransfected with CD20- 
10 GFP cDNA or spleen cells from wild type mice. Values represent relative 
indirect immunofluorescence staining intensity as assessed by flow 
cytometry and shown in figure 10H-J. 
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Vlll.B. B Cell Development and Function in CD20" 7 ' Mice 
CD20''* mice did not show an obvious propensity for infections during 
their first year of life. They had normal frequencies of IgM" B220 10 pro/pre-B 
cells, lgM + B220 !o immature B cells and lgM + B220 hi mature B cells in the 
5 bone marrow (Figure 11, Table 7). Overall, the number of circulating and 
spleen lgM + B220 + B cells found in CD20"/- mice was increased compared 
with wild type littermates (Table 7). However, an immunohistochemical 
analysis of spleen tissue sections revealed a normal architecture and 
organization of the spleen. In the bone marrow, overall IgM expression was 
10 decreased on immature B cells, yet increased on mature B cells when 
compared with IgM levels expressed by comparable cells in wild type 
littermates. However, overall IgM expression by mature B220 hi B cells in 
the blood, spleen and lymph nodes was slightly lower in CD20"/- mice 
(Figure 11B-D). There were no obvious differences in the size (light scatter 

15 properties) of CD2(W- B cells isolated from bone marrow, blood, lymph 
nodes or spleen when compared with B cells from wild type littermates. 
These data therefore suggest that CD20 plays a functional role in the 
development and tissue localization of B cells. 



20 
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Table 7 

Fre quencies and Numbers of B Lymphocytes in CP20 '" Mice 

Phenotype Wild Type CD20'" Wild Type CD20 



% of B Lymphocytes B cell numbers (x1 0" ; 



Rnnp Marrow 


B220 lo lgM" 


36 + 2 


34 ±3 








B220'°lgM + 


19 ±2 


13 ± 2 








B220 w lgM + 


14±2 


16±4 






Blood d 


B220 + lgM + 


61 ±2 


60 ±3 


3.6 ±0.5 


3.9 ±< 


Spleen 


B220 + lgM + 


51+6 


53 ±5 


58 ±12 


76 ± 


Lymph Nodes 6 


B220 + lgM + 


26 ±6 


19±2 


1.2 ±0.3 


0.9 ± 


Peritoneum 


B220 + lgM + 


70 ±4 


69 ±5 


2.4 ± 0.3 


3.1 ± 




B220 to CD5* 


44 ±4 


15 ±5** 


1.5 ±0.2 


0.7 ±( 




B220 hi CD5- 


28 ±2 


59 + 3** 


1.0 ±0.1 


2.7 ±( 



a Values represent mean (± SEM) results obtained from seven 2-month-old of 
5 wild type controls and 1 0 CD20"'- mice. Numbers represent the percentage 

of lymphocytes (based on side and forward light scatter properties) 

expressing the indicated cell surface markers. 

b B cell numbers were calculated based on the total number of cells 

harvested from the indicated tissues. 
1 0 d The values indicate the number of cells/ml. 

'Values represent results from peripheral lymph nodes pairs. 

♦The percentage or number was significantly different than in wild-type, p < 
0.05; **p<0.01. 
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Within the peritoneal cavity, the number of lgM + B220 + B cells in 
CD20 _/ - mice was similar to that of wild-type littermates (Table 7, Figure 
11E). However, there was a 4-fold decrease in the number of CD5 + B220 l0 
B1a cells, with a compensatory increase in the number of CD5~ B220 ni B2 
5 cells. Therefore, CD20-deficiency predominantly affected the development 
or clonal expansion of the B1 subpopulation of B cells within the peritoneal 
cavity. Exemplary methods for quantitating B cell populations are described 
in Example 7. 

VHI.C. Reduced \Ca ++ \i Responses in CD20 B Cells 

10 The loss of CD20 significantly altered early B cell signaling 

responses, measured as described in Example 8. Splenic B220 + B cells 
from CD20 7 " mice generated substantially reduced [Ca ++ ]i responses 
following surface IgM ligation when compared with wild type B cells. 
Decreased [Ca ++ ]i responses in CD20''* B cells were observed in response 

15 to both optimal (40 ug/ml, Figure 12A) and suboptimal concentrations (5 
ug/ml) of anti-IgM antibodies. Although the kinetics of [Ca ++ ]i responses in 
CD20"'" B cells was not altered, the magnitude of both the immediate [Ca ++ ]i 
increase and the sustained increase observed at later time points were 
inhibited by loss of CD20 expression. More dramatic decreases in [Ca ++ ]i 

20 responses (>50%) by CD20''' B cells were observed in response to CD19 
ligation with optimal concentrations (40 pg/ml) of antibody (Figure 12A). 
Reduced [Ca ++ ]i responses following CD19 ligation on CD20"'" B cells were 
likely to result from differences in signaling capacity since Thapsigargin- 
induced (Figure 12A) and lonomycin-induced [Ca ++ ]i responses were higher 

25 in CD20 - '- B cells than in wild type B cells. In addition, CD19 expression 
levels were not significantly different between CD20"'" and wild type B cells 
(Figure 12A). 

Chelation of extracellular calcium with EGTA reduced the kinetics and 
magnitude of the immediate [Ca ++ ]i increase observed following IgM 
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crosslinking (Figure 12A). However, the [Ca ++ ]i increase observed at later 
time points was not substantially inhibited by EGTA treatment. Similar 
results were observed in CD20"'' B cells. By contrast, chelation of 
extracellular calcium with EGTA almost eliminated the [Ca ++ ]i response 
5 observed following CD19 crosslinking (Figure 12A). This suggests that 
transmembrane Ca ++ flux contributes substantially to the [Ca ++ ]i responses 
observed following CD19 crosslinking. That CD20-deficiency had a 
substantial effect on CD19-induced [Ca ++ ]i responses suggests that CD20 
can contribute significantly to transmembrane Ca ++ flux. 
10 The consequences of CD20 loss on transmembrane signal 

transduction was further evaluated by assessing total cellular protein 
tyrosine phosphorylation in purified B cells following IgM ligation. Although 
some variation was observed between B cells from individual mice in 
individual experiments, overall levels of tyrosine phosphorylation in resting 
15 splenic B cells were higher in CD20" / - B cells than in wild type mice (Figure 
12C). In addition, protein phosphorylation in B cells from CD20"'" mice 
increased more significantly after B cell antigen receptor (BCR) ligation than 
in wild type B cells. Thus, while CD20 expression can influence BCR- 
induced tyrosine phosphorylation, decreased [Ca ++ ]i responses in CD20* 7 " B 
20 cells are unlikely to result from significant abnormalities in transmembrane 
signaling through the BCR. 

IX. Therapeutic Applications 

Another aspect of the present invention is a therapeutic method 
25 comprising administering to a subject a substance that modulates MS4A 
biological activity. Therapeutic substances include but are not limited to 
chemical compounds, antibodies, and gene therapy vectors. Substances 
that are discovered by the methods disclosed herein are useful for 
therapeutic applications related to disorders of MS4A function. 
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In one embodiment, the present invention provides a method for 
disrupting MS4A function by immunizing a subject with an effective dose of 
the disclosed MS4A polypeptide. The immune system of the subject 
produces an antibody that specifically recognizes the MS4A polypeptide, 
5 and binding of the antibody to the MS4A polypeptide abolishes MS4A 
function. 

In another embodiment, the present invention provides MS4A nucleic 
acid sequences and gene therapy methods for modulating MS4A activity in a 
target cell. The gene therapy vector can encode a MS4A or sequences 
10 encoding a nucleic acid molecule, peptide, or protein that interacts with a 
MS4A protein. 

Vehicles for delivery of a gene therapy vector include but are not 
limited to a liposome, a cell, and a virus. Preferably, a cell is transformed or 
transfected with the DNA molecule or is derived from such a transformed or 

15 transfected cell. Alternatively, the vehicle is a virus, including a retroviral 
vector, adenoviral vector or vaccinia virus whose genome has been 
manipulated in alternative ways so as to render the virus non-pathogenic. 
Methods for creating such a viral mutation are detailed in U.S. Patent No. 
4,769,331. Exemplary gene therapy methods are also described in U.S. 

20 Patent Nos. 5,279,833; 5,286,634; 5,399,346; 5,646,008; 5,651,964; 
5,641,484; and 5,643,567. 

The therapeutic methods of the present invention can be applied in 
the treatment of a variety of conditions, including in the treatment of non- 
Hodgkin's lymphoma and in the treatment of atopic disorders or other 

25 allergenic diseases. Application of the present inventive therapeutic 
methods are evidenced by the current U.S. Food and Drug Administration 
approved use of antibodies against CD20 in the treatment of non-Hodgkin's 
lymphoma. Additionally, the therapeutic methods of the present invention 
are illustrated in view of the recognition in the art that genetic variations at 

30 chromosome 11Q12-13 can also play a role in the pathogenesis of atopic 
disorders and other allergenic diseases. Indeed, it has been recognized that 
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FceRIp contributes to such diseases, and thus the MS4A genes identified in 
accordance with the present invention are envisioned also to contribute to 
allergenic disease. Therefore the present therapeutic methods, which 
pertain to the modulation of the biological activity of an MS4A polypeptide of 
5 the present invention have application with respect to the treatment of such 
disorders. 

X. Summary 

The invention comprises 19 new genes that are members of a class 
10 of genes encoding MS4A proteins. Three members have been described, 
CD20, FceRIp, and HTm4. A gene family has been defined based on a 
shared chromosomal location, conservation of protein size and structure, 
gene structure conservation, and similar expression in hematopoietic cells. 
MS4A proteins function as oligomeric cell surface complexes, and complex 
15 assembly using diverse MS4A members is implicated as a mechanism for 
regulating complex function. 

Two members of this class, CD20 and FceRIp, have been described 
functionally, and in each case an important function has been delineated. 
CD20 is required for cell cycle progression and signal transduction in B 
20 lymphocytes. CD20 also regulates Ca ++ conductance, possibly as a cation 
channel subunit. Of clinical relevance, antibodies that recognize CD20 are 
effective in treating non-Hodgkin's lymphoma. FceRIp mediates interactions 
with IgE-bound antigens that lead to degranulation of mast cells, and 
variation of the FceRIp locus is implicated in allergenic disease. 

25 The utility of the MS4A genes is based in part on overlapping or 

shared functions with known MS4A members. In one case, new MS4A 
genes have important potential as part of a CD20 complex. The structural 
description of CD20 complexes suggests that one or more CD20-related 
proteins constitute the functional complex. Thus, new MS4A proteins can 

30 define antigens useful for lymphoma treatment. In another case, MS4A 
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genes are implicated in IgE responses. Atopic disorders (allergy, asthma, 
eczema, allergic rhinitis) are dysfunctional IgE responses and are associated 
with a locus on human chromosome 11q containing most members of the 
MS4A gene family. FceRIp is one relevant factor, and recent work supports 
5 that FceRIp as well as other genetic elements in the region contribute to the 
disease. Thus, as disclosed herein, the present MS4A sequences also have 
utility in the characterization, diagnosis, and potential treatment of atopy 
linked to the chromosomal location wherein MS4A genes are located. 



10 Examples 

The following Examples have been included to illustrate modes of the 
invention. Certain aspects of the following Examples are described in terms 
of techniques and procedures found or contemplated by the present co- 
inventors to work well in the practice of the invention. These Examples 
15 illustrate standard laboratory practices of the co-inventors. In light of the 
present disclosure and the general level of skill in the art, those of skill will 
appreciate that the following Examples are intended to be exemplary only 
and that numerous changes, modifications, and alterations can be employed 
without departing from the scope of the invention. 

20 Example 1 

Database Searches and cDNA Isolation 

Three hundred and thirty seven nucleotide sequences obtained from 
the translated GenBank database of expressed sequence tags (ESTs) were 
assembled into sixty-two subgroups of contiguous linear segments based on 

25 their overlapping sequences and potential for encoding proteins homologous 
with CD20. Based on these subgroups, EST cDNAs (Figure 1) were 
obtained from the ATCC and sequenced. Based on the complete 
sequences of twenty-one near full-length EST cDNAs, eleven novel genes 
were defined in human and mouse that unified multiple EST subgroups. 

30 Near full-length EST clones representing these genes are shown in Figure 1. 
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These eleven genes and five additional genes were also identified by PCR 
amplification of transcripts using subgroup-specific primers or primers based 
on EST sequences. The specific details of how cDNAs representing the five 
genes that were not identified by EST cDNA clones are indicated below. In 
5 all cases, ESTs and cDNAs encoding the predicted coding regions of each 
putative unique gene were sequenced in both directions and at least two 
independent ESTs and/or cDNAs representing near full-length gene 
products were sequenced. Thereby, there was independent confirmation of 
accuracy for all of the sequences reported. 
10 Based on EST subgroup sequences, cDNAs encoding mouse 

MS4a4B and MS4a4C were isolated by PCR amplification of C57BL/6 
mouse spleen cDNA using both Taq and Pfu DNA polymerase. Primers for 
MS4a4B (SEQ ID NOs:63-64) amplified an 879 bp fragment. Primers for 
MS4a4C (SEQ ID NOs:65-66) amplified a 794 bp fragment. EST sequences 
15 for MS4a4D only encoded the 3' end of the predicted protein. Since 
MS4a4D sequences were closely related to MS4a4B and MS4a4C 
sequences, a sense 5' primer (SEQ ID NO:67) based on consensus 
MS4a4B and MS4a4C sequences and a MS4a4D-specific antisense primer 
(SEQ ID NO:68) were used to amplify a 773 bp fragment from cDNA of 
20 C57BL/6 mouse lung. 

MS4a6C was initially identified based on one unique EST sequence 
(AA028258) encoding a mouse protein homologous with the C-terminal end 
of MS4a6B. MS4a6C cDNAs were isolated by PCR amplification of 
C57BL/6 mouse bone marrow cDNA using Taq polymerase. A primer based 
25 on identical sequences at the 5' end of the MS4a6B and MS4a6D cDNAs 
(SEQ ID NO:69) was used in combination with an antisense primer specific 
for the unique EST sequence (SEQ ID NO:70) to amplify a 787 bp fragment. 
Sequences from multiple independent PCR-amplified cDNAs were identical. 
Subsequently, the PCR-generated 5' end of the near full-length MS4a6C 
30 cDNA was found to be identical to an orphan EST subgroup sequence that 
had not been linked with defined 3' sequences. Thereby, the EST subgroup 
sequences verified that the PCR-amplified 5' end of the MS4a6C cDNAs 
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was appropriate. In addition, the overall MS4a6C sequence was similar to 
the sequence of MS4a6B cDNAs without interruption. Thus, the MS4a6C 
cDNA united sequences identical to those found in two non-overlapping 
CD20-homologous EST subgroups. cDNAs encoding a 473 bp fragment of 
5 mouse MS4a3 were amplified from cDNA of C57BL/6 bone marrow as 
described above. Primers (SEQ ID NOs:71-72) were obtained based on a 
single thymic cDNA EST sequence (GenBank AA940479) where the 
corresponding cDNA was not available. 

Human MS4A and mouse MS4a cDNA sequences (MS4A1 to 
10 MS4A12) (disclosed herein) were used to search the htgs GenBank human 
genomic database of unfinished human genomic sequences 
(http://www.ncbi.nlm.nih.gov/blast/) using the BLAST program. Seventeen 
phase 1 or phase 2 human genomic DNA sequences encoding potential 
MS4A genes were assembled into groups of contiguous linear segments 
15 based on their overlapping sequences. Three EST clones corresponding to 
partial MS4A6E transcripts were obtained from the ATCC and sequenced 
completely on both DNA stands. 

All PCR-amplified cDNAs were subcloned and sequenced entirely in 
both directions. Complete sequencing of at least two distinct PCR- 

20 generated cDNAs from both Taq and Pfu enzyme was performed in most 
cases. Differences between cDNA sequences were only noted when 
multiple cDNA clones generated by both Taq and Pfu polymerases revealed 
identical differences. In some cases, cDNAs or EST sequences contained 
potential intron|exon splice sites that delimited structural domains and 

25 aligned with the known intron|exon splice sites of CD20 (Tedder et al. 
(1989b) J Immunol 142:2560-2568). In these cases, potential introns were 
flanked by consensus splice donor and/or splice acceptor sequences (Aebi 
& Weissmann (1987) Trends Genet 3:102-107) or were likely to represent 
splice variants where exons were deleted. 
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Example 2 

RNA Isolation and Reverse Transcription-PCR 

Reverse transcription-PCR amplification (RT-PCR) was as described 
previously (Zhou & Tedder, 1995) with minor modifications. Total RNA was 
5 extracted from 1-2 x 10 7 hematopoietic cell lines using a RNeasy Mini Kit 
(Qiagen, Inc., Chatsworth, California) according to the manufacturer's 
instructions. Human hematopoietic cell lines included one pre-B cell line 
(NALM-6), three B cell lines (BJAB, DAUDI, and SB), four T cell lines (HSB- 
2, HUT-78, JURKAT, and MOLT15), two myelomonocytic lines (HL60 and 

10 U937), and one erythroleukemia cell line (K562). RNA concentrations were 
determined by UV absorbance. Ten jxg of total RNA was reverse 
transcribed. In some cases, cDNA from any of 8 different human tissues 
(colon, ovary, blood mononuclear cells, prostate, small intestine, spleen, 
testes, and thymus; from CLONETECH Laboratories, Inc., Palo Alto, 

15 California) was analyzed. RT-PCR amplification was performed using gene- 
specific primers identical with protein coding regions of the predicted MS4A 
genes during 35 cycles (94°C for 1 min, 55°C for 1.5 min, 72°C for 1.5 min, 
followed by extension at 72°C for 5 min). Following amplification, the PCR 
products were separated on 1% agarose-ethidium bromide gels and 

20 photographed. G3PDH, a housekeeping gene, was also amplified to control 
for sample to sample variation. RNA amplified without reverse transcription 
was used as a negative control, and was negative in all cases. 

Example 3 

Recombinant Production of MS4A Protein 

25 For recombinant production of a protein of the invention in a host 

organism, a nucleotide sequence encoding the protein is inserted into an 
expression cassette designed for the chosen host and introduced into the 
host where it is recombinantly produced. The choice of the specific 
regulatory sequences such as promoter, signal sequence, 5' and 3' 

30 untranslated sequence, and enhancer appropriate for the chosen host is 
within the level of ordinary skill in the art. The resultant molecule, containing 
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the individual elements linking in the proper reading frame, is inserted into a 
vector capable of being transformed into the host cell. Suitable expression 
vectors and methods for recombinant production of proteins are well known 
for host organisms such as E. coli t yeast, and insect cells (see, e.g., 
5 Lucknow & Summers (1988) Bio/Technol 6:47). Additional suitable 
expression vectors are baculovirus expression vectors, e.g., those derived 
from the genome of Autographies californica nuclear polyhedrosis virus 
(AcMNPV). 

Recombinantly produced proteins are isolated and purified using a 
10 variety of standard techniques. The actual techniques used varies 
depending upon the host organism used, whether the protein is designed for 
secretion, and other such factors. Such techniques are well known to the 
skilled artisan. See Ausubel et at. (1994). 

Example 4 

15 Mouse Anti-Mouse CD20 Monoclonal Antibody Production 

Hybridomas producing CD20-specific mouse monoclonal antibodies 
were generated by the fusion of NS-1 myeloma cells with spleen cells from a 
CD20~ / " mouse immunized with a cell line expressing a mouse CD20-GFP 
fusion protein. The CD20-GFP fusion protein was generated by subcloning 

20 a fragment of the pmB1-1 cDNA (from 1 59 to 1 050 bp of SEQ ID NO:39) into 
the PEGFP-N1 vector (Clonetech Laboratories Inc., Palo Alto, California) to 
generate an open reading frame encoding the entire CD20 protein with GFP 
fused to the carboxyl-terminal end. The resulting plasmid was linearized 
with ApaL I and used to transfect 300.19 cells, a mouse pre-B cell line, and 

25 Chinese Hamster Ovary (CHO) cells. Transfection was by Lipofectamine 
following the manufacturer's instructions (Clonetech Laboratories, Inc.). 
Transfected cells were selected using GENETICIN™ (1 mg/ml, GIBCOBRL) 
in RPMI 1640 media (Sigma) for 300.19 cells or H-12 nutrient mixture 
(GIBCOBRL) for CHO cells. Both media were supplemented with 10% FCS, 

30 L-glutamine, streptomycin and penicillin. Transfected cells expressing high 
levels of CD20-GFP were isolated by fluorescence-based cell sorting. 
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Example 5 

In vitro Binding Assays 

Recombinant protein can be obtained, for example, according to the 
approach described in Example 4 herein above. The protein is immobilized 
5 on chips appropriate for ligand binding assays. The protein immobilized on 
the chip is exposed to sample compound in solution according to methods 
well known in the art. While the sample compound is in contact with the 
immobilized protein, measurements capable of detecting protein-ligand 
interactions are conducted. Measurement techniques include, but are not 
10 limited to, SEDLI, Biacore, and FCS, as described above. Compounds 
found to bind the protein are readily discovered in this approach and are 
subjected to further characterization. 

Example 6 

Generation of CD20-Deficient Mice 

15 DNA encoding the CD20 gene was isolated from a phage library 

prepared from 129/Sv strain mouse DNA (Figure 10A), mapped with 
restriction endonucleases, and sequenced to identify intron|exon boundaries 
(Figure 10B). The targeting vector was constructed using a pBluescript SK 
(Stratagene, La Jolla, California)-based targeting vector (p594, provided by 

20 Dr. David Milstone, Brigham and Women's Hospital, Boston, 
Massachusetts). A DNA fragment starting at the Pst I site in CD20 exon 5 
through the EcoR V site in exon 6 (-1.8 kb) was isolated and blunt end 
ligated into the targeting vector downstream of the pMC1-HSV thymidine 
kinase gene and upstream of the neomycin resistance marker obtained from 

25 pGK-neo poly A (Stratagene) that contained the PGK promoter and poly A 
signal sequence. An -10 kb DNA fragment beginning at the Kpn I site 
downstream of exon 8 was also isolated and inserted into the targeting 
vector downstream of the neomycin resistant gene. The plasmid was 
linearized using a unique Sal I restriction site proximal to the 3' end of the 

30 CD20 gene insert and used to transfect ES cells. 
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ES cells were transfected with linearized plasmid DNA and selected 
for G418 resistance as described (Keller and Smithies (1989) Proc Natl 
Acad Sci USA 886:8932). Genomic DNA from individual selected clones 
was digested with EcoR V and used for Southern blot analysis along with a 
5 radiolabeled -1.5 kb DNA probe that was external to the targeting vector 
(Figure 10D). A 4.6 kb genomic DNA fragment hybridized with the probe in 
wild type ES cells or a 6.3 kb fragment in appropriately targeted ES cells (Fig 
1E). Genomic DNA generated by BamH I, Ssc I or Kpn I digestion was also 
analyzed for appropriate targeting. The Southern blot pattern obtained in all. 

10 cases was consistent with the appropriate predicted mutation indicating that 
detrimental recombinations did not occur in the vicinity of the desired 
homologous recombination. Cells from appropriately targeted ES cell clones 
were injected into 3.5 day old C57BL/6 blastocysts that were transferred into 
foster mothers. Offspring carrying the mutant CD20 allele were identified by 

15 Southern blot analysis of DNA obtained from tail biopsies. 

High chimeric males (80-100% according to color) were bred with 
C57BL/6 (B6) females to generate heterozygous offspring with germline 
gene transmission, which were crossed to generate the homozygous CD20''" 
and wild type littermates used for this study. In some cases, B6/129F1J 

20 (Jackson Laboratory) were used as controls. Results obtained using wild 
type littermates of CD20*'~ mice were similar and were therefore pooled. All 
mice were between 2-3 months of age when used for this study. Mice were 
housed in a specific pathogen-free barrier-facility. All studies and 
procedures were approved by the Animal Care and Use Committee of Duke 

25 University. 

Example 7 

Flow Cytometric Analysis of Lymphocyte Subsets 

Single cell suspensions of lymphocytes from the spleen, bone 
marrow, peripheral lymph nodes, and peritoneal cavity were isolated from 
30 CD20* 7 " and wild type mice and counted using a hemocytometer prior to two- 
color immunofluorescence analysis. Retroorbital venous plexus puncture 
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was utilized to obtain circulating leukocytes. Leukocytes (0.5 x 10 6 ) were 
stained at 4°C using predetermined optimal concentrations of the test 
monoclonal antibody for 20 min as described (Zhou et al. (1994) Mol Cell 
Biol 14:3884-3894). Blood erythrocytes were lysed after staining using the 
5 Coulter Whole Blood Immuno-Lyse kit as detailed by the manufacturer 
(Coulter, Inc., Miami, Florida). Cells were washed and analyzed on a 
FACScan flow cytometer (Becton Dickinson, San Jose, California). 

Antibodies used in this study included the following: biotin, FITC- 
conjugated anti-B220 Mab (CD45RA, RA-3, 6B2, provided by Dr. Robert 

10 Coffman, DNAXCORP, Palo, Alto, California); PE-conjugated anti-mouse 
Thy1.2 (Caltag Laboratories, Burlingame, California); B220-PE (Caltag 
Laboratories, Burlingame, California); biotin-conjugated anti-l-A (BD 
PharMingen, Franklin Lakes, New Jersey); PE or APC-conjugated anti-CD5 
(BD PharMingen); PE-conjugated goat anti-mouse lgG3-specific antibody 

15 (Southern Biotechnology Associates Inc., Birmingham, Alabama); and biotin- 
conjugated anti-mouse IgD (Southern Biotechnology Associates Inc., 
Birmingham, Alabama). FITC or biotin-conjugated goat anti-mouse IgM 
isotype-specific antibodies (Southern Biotechnology Associates Inc., 
Birmingham, Alabama) were also used. 

20 Phycoerythrin-conjugated Streptavidin (Southern Biotechnology 

Associates Inc., Birmingham, Alabama) was used to reveal biotin-coupled 
monoclonal antibody staining. The percent positively stained lymphocytes 
was determined using a FACScan flow cytometer (Becton Dickinson, San 
Jose, California). Positive and negative populations of cells were 

25 determined by using unreactive monoclonal antibody (Caltag Laboratories, 
Burlingame, California) as controls for background staining. Background 
levels of staining were delineated using gates positioned to include 98% of 
the control cells. Ten thousand cells with the forward and side light scatter 
properties of lymphocytes were analyzed for each sample. 
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Example 8 

Intracellular Calcium Measurements 

Changes in lymphocyte [Ca 2+ ]j levels were monitored by flow 
cytometry analysis as described (Fujimoto et al. (1999) Immunity 11:191). 
5 Single cell suspension of splenocytes were resuspended (1 x 10 7 /ml) in 
RPMI 1640 medium containing 5% FBS, 10 mM HEPES and loaded with 
1jj.M of indo-1-AM for 30 min at 37°C. Splenocytes were then washed and 
incubated with a predetermined optimal concentration of FITC-conjugated 
anti-B220 monoclonal antibody for 15 min at room temperature. The 

10 splenocytes were washed again and resuspended at 2 x 10 6 /ml in medium. 
The fluorescence ratio (405/525 nm) of B220* splenic B cells was monitored 
by flow cytometry at baseline for 1 min and for 6 min after stimulation with 
optimal and suboptimal concentrations of goat F(ab')2 anti-IgM antibody (5- 
40 jxg/ml), optimal concentrations of anti-mouse CD19 monoclonal antibody 

15 (40 ng/ml), Thapsigargin (1 |ig/ml; Sigma), or lonomycin (2.67 jig/ml; 
Calbiochem Biosciences, Inc., La Jolla, California). In some cases, EGTA (5 
mM final; pH 7.0) was added to the cells, immediately followed by 
stimulation with the inducing agents described above. Results were plotted 
as the fluorescence ratio at 20 sec intervals with background fluorescence 

20 subtracted. An increase in the fluorescence ratio indicates an increase in 
[Ca 2+ ]i. 

Example 9 

Characterization of a MS4A Promoter Region 
A preferred in vitro technique for evaluating MS4A promoter function 
25 is a transient transfection assay. According to this method, one or more 
chimeric reporter genes comprising a MS4A promoter region is introduced 
into a relevant host cell (e.g., a hematopoietic cell), and the resulting level of 
reporter gene expression is quantitated. Representative methods for making 
an expression system comprising a promoter region operably linked to a 
30 heterologous reporter sequence are disclosed in U.S. Patent No. 6,087,1 11. 

To analyze the function of a MS4A promoter region in vivo, transgenic 
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mice bearing a chimeric gene comprising a MS4A promoter region are 
generated, and a level of reporter gene expression in each mouse is 
determined. 

Within a candidate promoter region or response element, the 
5 presence of regulatory proteins bound to a nucleic acid sequence can be 
detected using a variety of methods well known to those skilled in the art 
(Ausubel et al., 1992). Briefly, in vivo footprinting assays demonstrate 
protection of DNA sequences from chemical and enzymatic modification 
within living or permeabilized cells. Similarly, in vitro footprinting assays 
10 show protection of DNA sequences from chemical or enzymatic modification 
using protein extracts. Nitrocellulose filter-binding assays and gel 
electrophoresis mobility shift assays (EMSAs) track the presence of 
radiolabeled regulatory DNA elements based on provision of candidate 
transcription factors. Computer analysis programs, for example TFSEARCH 
15 version 1.3 (Yutaka Akiyama: 'TFSEARCH: Searching Transcription Factor 
Binding Sites", http://www.rwcp.or.jp/papia/), can also be used to locate 
consensus sequences of known cis-regulatory elements within a genomic 
region. 



20 



WO 02/062946 



87 



PCT/US01/48437 



References 

The publications and other materials listed below and/or set forth in 
the text above to illuminate the background of the invention, and in particular 
cases, to provide additional details respecting the practice, are incorporated 
5 herein by reference. Materials used herein include but are not limited to the 
following listed references. 

Adelman et al., (1983) DNA 2:183-193. 

Adra et al. (1994) Proc Natl Acad Sci USA 91:10178-10182. 

Adra et al. (1999) Clin Genet 55:431-437. 

1 0 Aebi and Weissmann (1 987) Trends Genet 3:1 02-1 07. 

Alam & Cook (1990) Anal Biochem 188:245-254. 

Altschul et al. (1990) J Mol Biol 21 5:403-410. 

Altschul et al. (1997) Nucleic Acids Res 25:3389-3402. 

Anderson et al. (1984) Stood 63:1424. 

15 Ausubel et al. (1992) Current Protocols in Molecular Biology . John 

Wylie and Sons, Inc., New York, New York. 

Barton (1 998) Acta Crystallogr D Biol Crystallogr 54: 1 1 39-1 1 46. 

Batzer et al. (1991) Nucleic Acids Res 19:3619-3623. 

Blank et al. (1989) Nature 337:187-189. 

20 Bodanszky, et al. (1 976) Peptide Synthesis . John Wiley and Sons, 

Second Edition, New York, New York. 

Bubien et al. J Cell B/o/ 121:1121-1132. 

Conner et al. (1983) Proc Natl Acad Sci USA 80:278-282. 

Cubitt et al. (1995) Trends Biochem Sci 20:448-455. 

25 Dietrich et al. (1996) Nature 380:149-152. 

Dombrowicz et al. (1998) Immunity 8:517-529. 



WO 02/062946 



88 



PCT/US01/48437 



Einfeld etal. (1988) EMBO J 7:71 1-71 7. 

Fujimoto et al. (1999) Immunity 1 1 :191 . 

Furumoto et al. (2000) Biochem Biophys Res Com 273:765-771 . 

Glover, ed. (1985) DNA Cloning: A Practical Approach. MRL Press, 
5 Ltd., Oxford, United Kingdom. 

Gorman etal. (1996) Immunity 5:241-252. 

Henikoff et al. (2000) Electrophoresis 21 (9): 1700-1 706. 

Henikoff & Henikoff (1989) Proc Natl Acad Sci USA 89:10915. 

Henikoff & Henikoff (2000) Adv Protein Chem 54:73-97. 

10 Harlow & Lane (1988^ Antibodies: A Laboratory Manual. Cold Spring 

Harbor Laboratory Press, Cold Spring Harbor, New York. 

Huang et al. (2000) Pac Symp Biocomput 230-241 . 

Hupp et al. (1989) J Immunol 143:3787-3791. 

Hutchens & Yip (1993) Rapid Commun Mass Spectrom 7: 576-580. 
15 Kanzaki et al. (1997a) J Biol Chem 272:14733-14739. 

Kanzaki et al. (1997b) J Biol Chem 272:4964^1969. 

Kanzaki et al. (1995) J Biol Chem 270:13099-13104. 

Karlin & Altschul (1993) Proc Natl Acad Sci USA 90:5873-87. 

Kinet (1999) Annu Rev Immunol 17:931-972. 
20 Kinet et al. (1 988) Proc Natl Acad Sci USA 85:6483-6487. 

Keller & Smithies (1989) Proc Natl Acad Sci USA 886:8932. 

Kozak (1986) Cell 44:283-292. 

Kuster et al. (1992) J Biol Chem 267:12782-12787. 

Kyte et al. (1982) J Mol Biol 1 57:105. 
25 Lander & Botstein (1989) Genetics 121 :185-199. 

Landgren et al. (1988) Science 241:1007. 



WO 02/062946 



89 



PCT/US01/48437 



Landgren et al. (1988) Science 242:229-237. 

Latorra et al. (1994) PCR Methods Appl 3(6):351-358. 

Li & Herskowitz (1993) Science 262:1870-1874. 

Liedberg et al. (1983) Sensors Actuators 4:299-304. 
5 Lin et al. (1996) Cell 85:985-995. 

Luckow & Schutz (1987) Nucleic Acids Res 15:5490. 

Luo et al. (1996) Biotechniques 20(4):564-568. 

Luyckx et al. (1999) Proc Natl Acad Sci USA 96(21 ):1 2174-121 79. 

Madge et al. (1972) Phys Rev Lett 29:705-708. 
10 McLaughlin et al. (1998) Oncology 12:1763-1769. 

Maiti et al. (1997) Proc Natl Acad Sci USA, 94:1 1753-1 1757. 

Malmquist (1993) Nature 361 :1 86-1 87. 

Mohan et al. (1999) 1999 103:1685-1695. 

Needleman & Wunsch (1970) J Mol Biol 48:443-453. 
15 Ohtsuka et al. (1985) J Biol Chem 260:2605-2608. 

Onrust et al. (1989) J Biol Chem 264:15323-15327. 

Pearson & Lipman (1988) Proc Natl Acad Sci USA 85: 2444-2448. 

Postic et al. (1 999) J Biol Chem 275(1 ):305-31 5. 

Ra etal. (1989) Nature 19:1771-1777. 
20 Rose & Botstein (1983) Meth Enzymol 101 :167-180. 

Rossolini et al. (1994) Mol Cell Probes 8:91-98. 

Saiki et al. (1985) Bio/Technology 3:1008-1012. 

Sambrook et al. eds. (1 989) Molecular Cloning: A Laboratory Manual. 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York. 

25 Sauer (1998) Methods 14(4):381-392. 



WO 02/062946 



90 



PCT7US01/48437 



Saqi et al. (1999) Bioinformatics 15:521-522. 
Schalkwyk et al. (1999) Genome Res 9:878-887. 
Sieghart et al. (1999) Neurochem Int 34:379-385. 
Silhavy et al. (1984) Experiments with Gene Fusions . Cold Spring 
5 Harbor Laboratory, Cold Spring Harbor, New York. 

Singh et al. (1989) Biotechniques 7:252-261 . 
Smith & Waterman (1981) AdvAppI Math 2:482. 
Stamenkovic & Seed (1988) J Exp Med 167:1975-1980. 
Stashenko et al. (1980) J Immunol 125:1678-1685. 
10 Tedder & Engel (1994) Immunol Today 15:450-454. 

Tedder et al. (1988a) J Immunol 141 -.4388-4394. 
Tedder et al (1988b) Proc Natl Acad Sci USA 85:208-212. 
Tedder et al. (1989a) J Immunol 142:2555-2559. 
Tedder et al. (1989b) J Immunol 142:2560-2568. 
1 5 Tijssen (1 993) Laboratory Techniques in Biochemistry and Molecular 

Bioloav-Hvbridization with Nucleic Acid Probes , part I chapter 2, Elsevier, 
New York, New York. 

U.S. Patent No. 4,196,265 
U.S. Patent No. 4,554,101 
20 U.S. Patent No. 4,736,866 

U.S. Patent No. 5,162,215 
U.S. Patent No. 5,234,933 
U.S. Patent No. 5,260,203 
U.S. Patent No. 5,326,902 
25 U.S. Patent No. 5,489,742 

U.S. Patent No. 5,550,316 



WO 02/062946 



91 



PCT/US01/48437 



U.S. Patent No. 5,573,933 
U.S. Patent No. 5,614,396 
U.S. Patent No. 5,625,125 
U.S. Patent No. 5,648,061 
5 U.S. Patent No. 5,741,957 

U.S. Patent No. 6,087,111 

Vidal et al. (1996) Proc Natl Acad Sci USA 93(1 9):1 031 5-1 0320. 
Weiner (1999) Semin Oncol 26:43-51 . 
Whiting (1999) Neurochem Int 34:387-390. 
10 WO 93/25521 

WO 97/47763 

Worrall et al. (1998) Anal Biochem 70:750-756. 

Zhou et al. (1994) Mol Cell Biol 14:3884-3894. 

Zhou & Tedder (1995) Blood 86:3295-3301 . 

1 5 Zimmer et al. (1 993) Peptides, pp. 393-394, ESCOM Science 

Publishers, B.V. 

It will be understood that various details of the invention can be 
changed without departing from the scope of the invention. Furthermore, the 
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CLAIMS 

What is claimed is: 

1. An isolated MS4A polypeptide, or functional portion thereof, 
comprising: 

5 (a) a polypeptide encoded by the nucleotide sequence of 

any one of the odd-numbered SEQ ID NOs:1-37; 

(b) a polypeptide encoded by a nucleic acid molecule that is 
substantially identical to any one of the odd-numbered 
SEQIDNOs:1-37; 

10 (c) a polypeptide having the amino acid sequence of any 

one of the even-numbered SEQ ID NOs:2-38; 

(d) a polypeptide that is a biological equivalent of the 
polypeptide of any one the even-numbered SEQ ID 
NOs:2-38; or 

15 (e) a polypeptide which is immunologically cross-reactive 

with an antibody that shows specific binding with a 
polypeptide of any one of the even-numbered SEQ ID 
NOs:2-38. 

2. An isolated nucleic acid molecule encoding a MS4A 
20 polypeptide, comprising: 

(a) the nucleotide sequence of any one of the odd- 
numbered SEQ ID NOs:1-37; or 

(b) a nucleic acid molecule substantially identical to any 
one of the odd-numbered SEQ ID NOs:1-37. 

25 3. The isolated nucleic acid molecule of claim 2, comprising a 20 

nucleotide sequence that is identical to a contiguous 20 nucleotide sequence 
of any one of the odd-numbered SEQ ID NOs:1-37. 

4. A chimeric gene, comprising the nucleic acid molecule of claim 
2 operably linked to a heterologous promoter. 
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5. A vector comprising the chimeric gene of claim 4. 

6. A host cell comprising the chimeric gene of claim 4. 

7. The host cell of claim 6, wherein the cell is selected from the 
group consisting of a bacterial cell, a hamster cell, a mouse cell, and a 

5 human cell. 

8. A method of detecting a nucleic acid molecule that encodes a 
MS4A polypeptide, the method comprising: 

(a) procuring a biological sample comprising nucleic acid 
material; 

10 (b) hybridizing the nucleic acid molecule of claim 2 under 

stringent hybridization conditions to the biological 
sample of (a), thereby forming a duplex structure 
between the nucleic acid of claim 2 and a nucleic acid 
within the biological sample; and 

15 (c) detecting the duplex structure of (b), whereby a MS4A 

nucleic acid molecule is detected. 

9. An antibody that specifically recognizes a MS4A polypeptide of 
claim 1 . 

10. A method for producing an antibody that specifically 
20 recognizes a MS4A polypeptide, the method comprising: 

(a) recombinantly or synthetically producing a MS4A 
polypeptide, or portion thereof; 

(b) formulating the polypeptide of (a) whereby it is an 
effective immunogen; 

25 (c) administering to an animal the formulation of (b) to 

generate an immune response in the animal comprising 
production of antibodies, wherein antibodies are present 
in the blood serum of the animal; and 
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(d) collecting the blood serum from the animal of (c), the 
blood serum comprising antibodies that specifically 
recognize a MS4A polypeptide. 

11. A method for detecting a level of MS4A polypeptide, the 
5 method comprising 

(a) obtaining a biological sample comprising peptidic 
material; and 

(b) detecting a MS4A polypeptide in the biological sample 
of (a) by immunochemical reaction with the antibody of 

10 claim 9, whereby an amount of MS4A polypeptide in a 

sample is determined. 

12. A method for identifying a substance that modulates MS4A 
function, the method comprising: 

(a) isolating a MS4A polypeptide of claim 1 ; 
1 5 ( b ) exposing the isolated MS4A polypeptide to a plurality of 

substances; 

(c) assaying binding of a substance to the isolated MS4A 
polypeptide; and 

(d) selecting a substance that demonstrates specific 
20 binding to the isolated MS4A polypeptide. 

13. A method for modulating MS4A function in a subject, the 
method comprising: 

(a) preparing a pharmaceutical composition, comprising a 
substance identified according to the method of claim 10 

25 or 1 2, and a carrier; and 

(b) administering an effective dose of the pharmaceutical 
composition to a subject, whereby MS4A activity is 
altered in the subject. 
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14. The method of claim 13, wherein the substance is an antibody, 
a protein, a peptide, or a chemical compound. 

15. The method of claim 13, wherein MS4A activity is regulation of 
the abundance of target cell subpopulations. 

5 16. The method of claim 13, wherein MS4A activity is regulation of 

[Ca 2+ ]j levels. 

17. A method for identifying a candidate compound as a modulator 
of MS4A gene expression, the method comprising: 

(a) exposing a cell sample with a candidate compound to be 
10 tested, the cell sample containing at least one cell containing a 

DNA construct comprising a modulatable transcriptional 
regulatory sequence of a MS4A-encoding nucleic acid and a 
reporter gene which is capable of producing a detectable 
signal; 

15 (b) evaluating an amount of signal produced in relation to a control 

sample; and 

(c) identifying a candidate compound as a modulator of MS4A 
gene expression based on the amount of signal produced in 
relation to a control sample. 

20 18. The method of claim 17, wherein the modulatable 

transcriptional regulatory sequence of a MS4A-encoding nucleic acid 
comprises a sequence that is immediately upstream of the initial coding 
region of a MS4A gene as set forth in any one of SEQ ID NOs:73-81 . 

19. A method for modulating MS4A function in a subject, the 
25 method comprising: 

(a) preparing a gene therapy vector having a nucleotide 
sequence encoding a MS4A polypeptide or a nucleotide 
sequence encoding a nucleic acid molecule, peptide, or 
protein that interacts with a MS4A nucleic acid or 
30 polypeptide; and 
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(b) administering the gene therapy vector to a subject, 
whereby the function of MS4A in the subject is 
modulated. 
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ATCTCCCCAA TGQTCCCCAA CTGCGCCAAT AGTCTCTGAC AGAAGATACA CACTATAAAA TGCTCTTCAT 



A4E CTCTGACAAG ATTAGATCCT GTGCTCGCTT TCTTGCTATT CTTCTAACCT GGTCATCGCC CCCCGACTCC AAC 3 
A4A GTCTCAGAAO ATTAGATGAT CTGGTGGGTT TGTTTTTCTT CTTACAAGGT GGTCA CA AGAGGGGTCC (SXcj 3 
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A4E 
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A4E 
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A4E 
A4A 



ATT CCT CCA AAC OGC CTC AAC ATA TGC AAA TCT CTC AAT ATA OCA ATC AAA TTA CTT 
ATT CCT GCA AAT OCT TTC AAT ATA TGC AGA TGT CTC GAT ATA CCA ATG AAA TTA CCT CTT 



A4E TAG AAC AAC TTA AAT AAC TCC AAT ATA CTT CCG CCT TTA AAA AAT AAA AAC GAG AAA TTC 120 

A4A TGC AAC AAC TTA AAT AAG TCA AAT ATA CTT GCA CCT TTA AA AAT TAA AAC GAG AGA CAT 122 

A4E MTTMQGMEOTTPC 

A4E AAG | ACC ACC TTT TCT CCT GCC ATG ACA ACC ATC CAA GCA ATG CAA CAG ACC ACT CCA CCG 180 

A4A TCC AG|C ACC TTT TCT CCT GCC ATG ACA ACC ATG CAA CCA ATC GAA CAG GCC ATG CCA GCC 182 
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A4A 

A4E 
A4E 
A4A 
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A4E 
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A4E 
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A4A 

A4E 
A4E 
A4A 

A4E 
A4E 
A4A 

A4E 
A4E 
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PGPDVPQLGNIDVIHSYLCK 

CCT GCC CCT CAT CTC CCC CAG CTC GCA AAC ATA CAT CTC ATA CAT TCA TAT CTC TGT AAA 240 

CCT CCC CCT GOT CTC CCC CAG CTC GCA AAC ATG CCT GTC ATA CAT TCA CAT CTC TGC AAA 1C2 

GLQEKFFKftKPKVLGVVRI L 

GCA TTC CAA CAG AAG TTC TTC AAC AGO AAA CCC AAA CTC CTT CCC | CTT GTC CCG ATT CTC 300 

GGA TTC CAA GAG AAC TTC TTG AAG GCA GAA CCC AAA GTC CTT GCC I CTT GTC CAG ATT CTC 302 

IALMSLSMCIIHHCVAFSSY 

ATT CCC TTG ATC ACC CTT ACC ATC CCA ATA ATA ATC ATC TGT CTT GCA TTT ACT TCT TAT 360 

ACT CCC CTC ATG ACC CTT AGC ATC CCA ATA ACA ATG ATC TGT ATC GCA TCT AAT ACT TAT 362 

EEHPIFVYVAYTIWGSVMFI 

CAA CAA CAT CCC ATT TTT CTC TAT CTT CCC TAC ACA ATT TCC GCC TCA CTC ATC I TTT ATT 420 

CCA ACT AAC CCT ATT TCC GTC TAT ATC CCC TAC ACA ATT TGC CCC TCA CTA ATG I TTT ATT 422 

ISVSLSVAAGIRTTXGtVCO 

ATT TCA CTA TCC TTA TCA GTT CCA GCA GCA ATT AGA ACA ACA AAA GOT CTGjGTT CCA CCT 4B0 

ATT TCA GCA TCC TTG TCA ATT CCA GCA GCA ATT AGA ACT ACA AAA GCC CTC | GTC CCA CCT 482 



SLCKNITSSViAISGlLINA 

ACT CTA GGA AAG AAT ATC ACC ACT TCA CTC TTG CCT ATA TCA CCC ATC TTA ATC AAT CCA S40 

ACT CTA GCA ATG AAT ATC ACC AGC TCT CTA CTC CCT GCA TCA GCC ATC TTA ATC AAC ACA 542 

IS LTFY SPRYHYCNHDQLSS 

ATA ACC TTG A CO TTT TAT TCA TTC CCT TAC CAT TAC TGT AAC CAC GAT CAG TTG TCA ACT 600 

TTT ACC TTG CCC TTT TAT TCA TTC CAT CAC CCT TAC TGT AAC TAC TAT CCC AAC TCA AAT 602 

NCYMTMS ILMGTDCHVLL LS 

AAT TGT TAC ATG ACT ATG TCC ATT TTA ATG t CCT ACC CAT GCC ATC CTC CTC CTC TTA ACT 660 

AAT TCT CAT CCG ACT ATG TCC ATC TTA ATClGCT CTC CAT GGC ATC GTC CTC CTC TTA ACT 662 

VLEFCIAVSLSAPCCKVLCC 

CTC CTC CAA TTC TCC ATT CCT CTC TCC CTC TCT CCC TTT CCA TCT AAA GTC CTT TCT TCT 720 

GTC CTG GAA TTC TCC ATT GCT CTC TCC CTC TCT CCC TTT GGA TCT AAA CTC CTC TGT TCT 722 

A4E SPSEFVL1TPSNSHMAEIAS 

A4E AGC CCC ACT GAG I TTT CTC CTA ATT ACA CCA TCA AAT TCT CAC ATC CCA CAA ATA CCA TCT 780 

A4A ACC CCT GCT CCC (CTT GTG TTA ATT CTC CCA TCA CAT TCT CAC ATG GCA GAA ACA GCA TCT 782 

A4E PTPLKT V* 

A4E CCC ACA CCA CTT AAG ACA CTT TCA TGCCACCAAA ACATTAACAC AAGAATGCTC CACAAATCTA 844 

A4A CCC ACA CCA CTT AAT CAC GTT TCA CCCCACCAAA ACATCAACAG ACAAATGCTC CACAAATCTA 846 

A4E TCCTCACTCT AACACAACAA CC CACATCA CAAAGTACCA GAATCCAACT CCAATACTCA TACACATATT 913 

A4A TCCTCACTCT GACACAAGAC CCTCACATCA CAAATTACCA GTATCCAACT TCCATACTGA TACACTTCTT 916 

A4E GAT ATC ATT A TTATATCGAA TCCAATTATG ACCTCTCTCT GTGTGTGTCT CTGTATATAT ATATATACAT 983 

A4A CATATTATTA TTATATGTAA TCCAATTATG AACTGTGTCT CTATA 961 

A4E ATATATATAT ATACATATAT ATATATCTGT CTCTCTG TAT ATAT TCAAAA TTTTGTTCTC ATTTTTTCCC 1053 

A4A CACACA TfrAT AAAfr TCAAAA TTATGTTCTC ATTTTTTTCC 1001 

A4E CTGCAACTCA ACAACTAATT TCATTCCCCC TTTATCCACA GTACTAGAAC TTAAAT T^AT AAAf TAATGCA 1123 

A4A CTGCAACTCA ATAACTCATT TCACTCGCTC TTTATCCACA CTACTAGAAG TTAAATt£a?j5&AATGCA 1071 



TTTAATGAGC CAGCACCACT TGAAACGTTT TCATTCATCA TTACCACTTT ATATAAACGC ATTAAACTGGC 
TTTAATGACC CAACACCACT TGAAAGTTTT TCATTCATCA TAACAACTTT ATATAAACGC ATTACATTCGC 

AAATAAGATT TCCAAGCAGA AGGGCAAAA CCTATTG CTAAAACGAG CTCTCCATCC AAA CACATA 

AAATAAGCTT TCCAACCACA ACAGCAAAAA AAACATATTG TTAAAATCAG GCCTCCATCC AAAACACATA 

CTTCTCCTCC CCTCTATAAC ATTCCTCTCA CTTACTTCAC TTTTTT fl ' lU CCATATTTCC GCACCAAACT 
CTTCC CTCC C ATTTAT TTA ACTT rTTTTTTCTC CTACCTATCC GCACCAAACT 

GC TtTn- CC 'l TCATCAAGTG GAGATTCATG CCCTTCTCCC a tHUTIT CCTTCT GCTTTCCTTC 
CCTTTTTCCT TCAGCAACTG GAGATCCATC CCCATCTCCC CCTCCCTTTT TCCTTCTCCT GCTTTTCTTT 

ACCCATAGAA ACTACCTTCC AATAGTATAC TCACTCCTTG CATCTCCACA AGCTATCATT TCACTAAACC 
CCCCATAGAA ACTACCTTCA ACTACCACAC TCCCTCCTTC CATCTCCACG AGCTATCATT TCACTAAAAC 

TATACATCGA GTAAAAATCA TATGAAGGAT CAGATTCAAC TTATATTTTC TATTTTTTCT r i n il T CTC 
TATACATCGA GTAAAAATCA TATTAAGCAT CAGATTCAAC TTATATTTTC TATTTCATCT TCTTCCTTTC 



CCTTCCCCCA CCnt.l tX -r G GGCAGAATTA TATCTTAATC AAATCTCTAT CCTCTCTCAC ATATCGAAAT 
CCTTCTCCCA CCTTCT ACTG GGCATAATTA TATCTTAATC ATATATGCAA ATCTGCAAC ATATCCTATT 
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Figure 7 
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CLQSKELRPTEYHYYQ FLDRNE 170 
CKQSKELSLIEHDYYQPPYNSDRSE 173 
CKLAFTOLDTTODAYHFFSPEPLNS 173 
CKLDEKDI PTRLLLSYDYHSPYT KD 109 
TASQHCGSEMDYLSSLPYSEYYYP I YE I KD 173 
SAFPHCNSEKKFLSLLSYLKSHHWKNEDKN 170 
PYAYPDYYPY 174 
GYI YPSYYPYQE 192 
PFESPIWRMYPNST 164 
PFPWPIWRPYPEPT 161 
VAGQD 192 
LPEQD 189 



{ TM4 J 

CYSIQSLFL| GI LSVMLIPAFFQELVIACI VENEVKRTCSRPK5 1 NI VLLSAEEKXEC^I EIKEEVVGLTETSSQPKNEEDI EI I PIQEEEEEETETNFPEPPQDQES5PIENDSSP297 
CNSIQSVFL G I LSAML I SAFFQKLVTAG I VENEWKRMCTRS KS NWLLSAGEKNEQTI KMKEE I IELSGVSSQPXNEEEIEI I PVQEEEEEEAEI NPPAPPQEQESLPVENEI AP291 



CFMASPS|T EI WMMLFLTI LGLGSAVSLTICGAGEELKGNK| VPEORVYEELN I YSATYSELEDPGEMSPP I DL 24 4 

a2 C FVASFTT ELVLMMLFLTILAFCSAVLFTI YRIGQELESKK VPDDRLYEELNVYSPIYSELEDXCETSSPVDS 235 

A3 CNYMGSISN GMVS LLL I LTLLELCVTI ST I AMWCNANCCNSRE EISSPPNSV 214 

A4A CHGTMSI LM I CUX>WLLLSVI^FCI AVSL5AFGCKVLCCTPGG | WLILPSHSKMAETASPTPLNEV 220 

a4B FRYNYT ITK | GLDVLMLI PNMLEFCLAVSVSAFGCEASCCNSRE | VLVVLPSNFWTVMAPPMTU2PLLPSEH<XrimfPanry-KK^PGEIV . 226 

a4C FRYNYT ITK GLDILMLILNMLEFCIAVSI SAFGCKASCCNSSE| VLV^PSOTAVTWAPPWLOPLPPSEKQGKIWPENVYKNHSEEIV 226 

a4D QFRSQPAIA SLDVLMTILhWLEFCI AVSVSAFGCKASCCNSSE VLWLPSNSAVTVTAPPMI LQPLPPSECOGKNVPENLYRNQPGEIV 225 

A4E cymtmsilmIgttx;mvlllsvlefciavslsafgckvlccspse| FVLITPSNSHMAEIASPTPLKTV 220 

A5 CKAVTVLPL GILITLMTPS! IF^PISLPPSIUXMSEDCDCEQCC 200 

A6A CYTAKASLA [ GTLSLMLICTLLEFCLAVLTAVLRWKOAYSOFPG | SVLFLPHSY IGNSGMSSKMTHDCGYEELLTS 24 8 

a6B CFAAKSVLA GVFSU^LISTMLELGLAVLTAMLWWKOSHSNI PG NVMFLPHSSNNDSNMESKVLCNPSYEEQLVC 244 

a6C CAVTKSILT GALSVMLI ISVLELGLALLSAMLWtRECVLTSLRM 217 

aSD CFVAKAALT GVPSLWL ISSVLELG LAVLTATLWNKOSSSAFSG NVIFLSONSKNKSSVSSESLCNPTYENILTS 247 

A6E CKRAKASLA I GTLSLMLVSTVl^FCLAVLTAVLCV'KGTV 147 

A7 CLLTSVSLTl GVLVVMLI FTVLELLUUVYSSVFWWKOLYSNNPG | SSFSSTQSQDHIQOVKKSSSRSWI 240 

a7 CYLAYVGAM SALGMMLLPTVLEVFLAGYSSI FWWKOVYSNKPG GTFFLPOSQDHTQLVKSNLLQ 234 

A8B AWGV | N PGMA I SGVLLV FC I.LEPG I ACAS SHPGCQLVCCQS SN | VSVIYP NI YAANPVITPEPVTSPPSYSSEIQANK 250 

aSB EMLCV RTGVAI SSVLLI FCLLELSI ASVSSHFGCOVACCHYNN PGWI P| NVYAANPWIPEPPNPIPSYSEWQOSR 268 

AlO | VH I ORLELALLCFTVLELFLPVPTAVTAKRCaX!PSAK | NDOACLVPNT P LHLKC L PVEP PPS YQSV I QCDAQH KQHQR 241 

alO TYIOPO.ELTLFCFTFI^IFLSCSTAITAYRMKRLQAFJDKDirrPFVPirrPMELKGl^LCPPPSY 267 

A12 DYWAV | LSGKG I SATLM I FSLLEFFVACATAHFANOANTTTNM | SVLVI PNMYESNPVTPASSSAPPRCNNYSANAPK 267 

al2 DYWAV LSGKGISAMLI IFSLLEFCI TCVTAYFASHTITNTRGL SWSFHLCMQTVP 243 
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Sequence Listing 



<110> Tedder, Thomas 
Liang, Yinghua 

<12 0> Identification of Novel MS4A Gene Family Members Expressed by 
Hematopoietic Cells 

<130> 180/132 

<160> 81 

<170> Patentln version 3.0 

<210> 1 

<211> 1619 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1)..(1619) 

<220> 

<221> CDS 

<222> (144) . . (806) 



<400> 1 

aacattcctg caaatggttt caatatatgc agatgtctcg atataggaat gaaattacgt 



60 



ctttggaaca acttaaataa gtcaaatata cttggagctt taaaaattaa aaggagagag 



120 



attcgagcac cttttctgct gcc atg aca acc atg caa gga atg gaa cag gcc 

Met Thr Thr Met Gin Gly Met Glu Gin Ala 
15 10 



173 
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atg cca ggg get ggc cct ggt gtg ccc cag ctg gga aac atg get gtc 221 
Met Pro Gly Ala Gly Pro Gly Val Pro Gin Leu Gly Asn Met Ala Val 
15 20 25 

ata cat tea cat ctg tgg aaa gga ttg caa gag aag ttc ttg aag gga 269 
He His Ser His Leu Trp Lys Gly Leu Gin Glu Lys Phe Leu Lys Gly 
30 35 40 

gaa ccc aaa gtc ctt ggg gtt gtg cag att ctg act gee ctg atg age 317 
Glu Pro Lys Val Leu Gly Val Val Gin He Leu Thr Ala Leu Met Ser 
45 50 55 

ctt age atg gga ata aca atg atg tgt atg gca tct aat act tat gga 365 
Leu Ser Met Gly He Thr Met Met Cys Met Ala Ser Asn Thr Tyr Gly 
60 ' 65 70 

agt aac cct att tec gtg tat ate ggg tac aca att tgg ggg tea gta 413 
Ser Asn Pro He Ser Val Tyr He Gly Tyr Thr He Trp Gly Ser Val 
75 80 85 90 

atg ttt att att tea gga tec ttg tea att gca gca gga att aga act 461 
Met Phe He He Ser Gly Ser Leu Ser He Ala Ala Gly He Arg Thr 
95 100 105 



aca aaa ggc ctg gtc cga ggt agt eta gga atg aat ate acc age tct 
Thr Lys Gly Leu Val Arg Gly Ser Leu Gly Met Asn lie Thr Ser Ser 
110 115 120 



tat tea ttc cat cac cct tac tgt aac tac tat ggc aac tea aat aat 
Tyr Ser Phe His His Pro Tyr Cys Asn Tyr Tyr Gly Asn Ser Asn Asn 
140 145 150 



etc tta agt gtg ctg gaa ttc tgc att get gtg tec etc tct gee ttt 
Leu Leu Ser Val Leu Glu Phe Cys He Ala Val Ser Leu Ser Ala Phe 
175 180 185 



509 



gta ctg get gca tea ggg ate tta ate aac aca ttt age ttg gcg ttt 557 
Val Leu Ala Ala Ser Gly He Leu He Asn Thr Phe Ser Leu Ala Phe 
125 130 135 



605 



tgt cat ggg act atg tec ate tta atg ggt ctg gat ggc atg gtg etc 653 
Cys His Gly Thr Met Ser He Leu Met Gly Leu Asp Gly Met Val Leu 
155 160 • 165 170 . 



701 



gga tgt aaa gtg etc tgt tgt acc cct ggt ggg gtt gtg tta att ctg 749 
Gly Cys Lys Val Leu Cys Cys Thr Pro Gly Gly Val Val Leu He Leu 
190 195 200 

cca tea cat tct cac atg gca gaa aca gca tct ccc aca cca ctt aat 797 
Pro Ser His Ser His Met Ala Glu Thr Ala Ser Pro Thr Pro Leu Asn 
205 210 215 

gag gtt tga ggccaccaaa agatcaacag acaaatgetc cagaaatcta 84 6 

Glu Val 
220 

tgctgactgt gacacaagag cctcacatga gaaattacca gtatccaact tcgatactga 906 
tagacttgtt gatattatta ttatatgtaa tccaattatg aactgtgtgt gtatagagag 966 
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ataataaatt caaaattatg ttctcatttt 


tttccctgga 


actcaataac tcatttcact 


1026 




ggctctttat cgagagtact agaagttaaa 


ttaataaata 


atgcatttaa tgaggcaaca 


1086 


5 


gcacttgaaa gtttttcatt catcataaga 


actttatata 


aaggcattac attggcaaat 


1146 




aaggtttgga agcagaagag caaaaaaaag 


atattgttaa 


aatgaggcct ccatgcaaaa 


1206 


10 


cacatacttc cctcccattt atttaacttt 


ttttttctcc 


tacctatggg gaccaaagtg 


1266 


ctttttcctt caggaagtgg agatgcatgg 


ccatctcccc 


ctcccttttt ccttctcctg 


1326 




cttttctttc cccatagaaa gtaccttgaa 


gtagcacagt 


ccgtccttgc atgtgcacga 


1386 


15 


gctatcattt gagtaaaagt atacatggag 


taaaaatcat 


attaagcatc agattcaact 


1446 




tatattttct atttcatctt cttcctttcc 


cttctcccac 


cttctactgg gcataattat 


1506 


20 


atcttaatca tatatggaaa tgtgcaacat 


atggtatttg 


ttaaatacgt ttgtttttat 


1566 


tgcagagcaa aaataaatca aattagaagc 


aataaaaaaa 


aaaaaaaaaa aaa 


1619 




<210> 2 








25 












<211> 220 










<212> PRT 








30 


<213> Homo sapiens 










<400> 2 








OD 












Met Thr Thr Met Gin Gly Met Glu Gin Ala Met 


Pro Gly Ala Gly Pro 






1 5 


10 


15 




40 


Gly Val Pro Gin Leu Gly Asn Met Ala Val He 


His Ser His Leu Trp 






20 25 


30 




45 


Lys Gly Leu Gin Glu Lys Phe Leu Lys Gly Glu 


Pro Lys Val Leu Gly 




35 40 




4 b 






Val Val Gin He Leu Thr Ala Leu Met Ser Leu 


Ser Met Gly He Thr 






50 55 




ov7 




50 












Met Met Cys Met Ala Ser Asn Thr Tyr Gly Ser 


Asn Pro He Ser Val 






65 70 


75 


80 




55 












Tyr lie Gly Tyr Thr He Trp Gly Ser Val Met 


Phe He He Ser Gly 






85 


90 


95 




60 


Ser Leu Ser He Ala Ala Gly He Arg Thr Thr 


Lys Gly Leu Val Arg 






100 105 


110 





-3- 
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Gly Ser Leu Gly Met Asn He Thr Ser Ser Val Leu Ala Ala Ser Gly 
115 12 0 125 



He Leu He Asn Thr Phe Ser Leu Ala Phe Tyr Ser Phe His His Pro 
130 135 140 



Tyr Cys Asn Tyr Tyr Gly Asn Ser Asn Asn Cys His Gly Thr Met Ser 
145 150 155 160 



lie Leu Met Gly Leu Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 
165 170 175 



Phe Cys lie Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 
180 185 190 



Cys Thr Pro Gly Gly Val Val Leu He Leu Pro Ser His Ser His Met 
195 200 205 



Ala Glu Thr Ala Ser Pro Thr Pro Leu Asn Glu Val 
210 215 220 



<210> 3 

<211> 663 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..(663) 



<400> 3 

atg aca acc atg caa gga atg gaa cag acc act cca ggg cct ggc cct 48 
Met Thr Thr Met Gin Gly Met Glu Gin Thr Thr Pro Gly Pro Gly Pro 
15 10 15 

gat gtg ccc cag ctg gga aac ata gat gtc ata cat tea tat ctg tgt 96 
Asp Val Pro Gin Leu Gly Asn He Asp Val lie His Ser Tyr Leu Cys 
20 25 30 

aaa gga ttg caa gag aag ttc ttc aag agg aaa ccc aaa gtc ctt ggg 144 
Lys Gly Leu Gin Glu Lys Phe Phe Lys Arg Lys Pro Lys Val Leu Gly 
35 40 45 
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gtt gtg egg att ctg att gec ttg atg age ctt age atg gga ata ata 192 
Val Val Arg He Leu lie Ala Leu Met Ser Leu Ser Met Gly He He 
50 55 60 

atg atg tgt gtt gca ttt agt tct tat gaa gaa cat ccc att ttt gtg 240 
Met Met Cys Val Ala Phe Ser Ser Tyr Glu Glu His Pro He Phe Val 
65 70 75 80 

tat gtt gcg tac aca att tgg ggg tea gtg atg ttt att att tea gta 288 
Tyr Val Ala Tyr Thr lie Trp Gly Ser Val Met Phe He He Ser Val 
85 90 95 

tec tta tea gtt gca gca gga att aga aca aca aaa ggt ctg gtt gga 336 
Ser Leu Ser Val Ala Ala Gly He Arg Thr Thr Lys Gly Leu Val Gly 
100 105 110 

ggt agt eta gga aag aat ate ace agt tea gtc ttg get ata tea ggg 3 84 

Gly Ser Leu Gly Lys Asn He Thr Ser Ser Val Leu Ala lie Ser Gly 
115 120 125 

ate tta ate aat gca ata age ttg acg ttt tat tea ttc cgt tac cat 432 
He Leu lie Asn Ala lie Ser Leu Thr Phe Tyr Ser Phe Arg Tyr His 
130 135 140 

tac tgt aac cac gat cag ttg tea agt aat tgt tac atg act atg tec 480 
Tyr Cys Asn His Asp Gin Leu Ser Ser Asn Cys Tyr Met Thr Met Ser 
145 150 155 160 

att tta atg ggt acg gat ggc atg gtg etc etc tta agt gtg ctg gaa 528 
He Leu Met Gly Thr Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 
165 170 175 

ttc tgc att get gtg tec etc tct gee ttt gga tgt aaa gtg ctt tgt 576 
Phe Cys lie Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 
180 185 190 

tgt age ccc agt gag ttt gtg eta att aca cca tea aat tct cac atg 624 
Cys Ser Pro Ser Glu Phe Val Leu lie Thr Pro Ser Asn Ser His Met 
195 200 205 

gca gaa ata gca tct ccc aca cca ctt aag aca gtt tga 663 
Ala Glu lie Ala Ser Pro Thr Pro Leu Lys Thr Val 
210 215 220 



<210> 4 

<211> 220 

<212> PRT 

<213> Homo sapiens 



<400> 4 



Met Thr Thr Met Gin Gly Met Glu Gin Thr Thr Pro Gly Pro Gly Pro 
1 5 10 15 
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Asp Val Pro Gin Leu Gly Asn He Asp Val He His Ser Tyr Leu Cys 
20 25 30 



Lys Gly Leu Gin Glu Lys Phe Phe Lys Arg Lys Pro Lys Val Leu Gly 
35 ' 40 45 



Val Val Arg He Leu He Ala Leu Met Ser Leu Ser Met Gly He He 
50 55 60 



Met Met Cys Val Ala Phe Ser Ser Tyr Glu Glu His Pro He Phe Val 
65 70 75 80 



Tyr Val Ala Tyr Thr He Trp Gly Ser Val Met Phe He He Ser Val 
85 90 95 



Ser Leu Ser Val Ala Ala Gly He Arg Thr Thr Lys Gly Leu Val Gly 
100 105 HO 



Gly Ser Leu Gly Lys Asn He Thr Ser Ser Val Leu Ala He Ser Gly 
115 120 125 



He Leu He Asn Ala He Ser Leu Thr Phe Tyr Ser Phe Arg Tyr His 
130 135 140 



Tyr Cys Asn His Asp Gin Leu Ser Ser Asn Cys Tyr Met Thr Met Ser 
145 150 155 160 



He Leu Met Gly Thr Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 
165 170 175 



Phe Cys He Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 
180 185 190 



Cys Ser Pro Ser Glu Phe Val Leu He Thr Pro Ser Asn Ser His Met 
195 200 205 



Ala Glu He Ala Ser Pro Thr Pro Leu Lys Thr Val 
210 215 220 



<210> 5 
<211> 694 
<212> DNA 



<213> Homo sapiens 
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<220> 

<221> CDNA 
<222> (1)..(694) 



<220> 

<221> CDS 

<222> (55) . . (657) 



<400> 5 

ctagactgaa gtaccaacta aatcatctcc tttcaaatta tcaccgacac catc atg 57 

Met 
1 

gat tea age acc gca cac agt ccg gtg ttt ctg gta ttt cct cca gaa 105 

Asp Ser Ser Thr Ala His Ser Pro Val Phe Leu Val Phe Pro Pro Glu 
5 10 15 

ate act get tea gaa tat gag tec aca gaa ctt tea gcc acg acc ttt 153 

lie Thr Ala Ser Glu Tyr Glu Ser Thr Glu Leu Ser Ala Thr Thr Phe 
20 25 30 

tea act caa age ccc ttg caa aaa tta ttt get aga aaa atg aaa ate 201 

Ser Thr Gin Ser Pro Leu Gin Lys Leu Phe Ala Arg Lys Met Lys lie 
35 40 45 

tta ggg act ate cag ate ctg ttt gga att atg acc ttt tct ttt gga 24 9 

Leu Gly Thr lie Gin lie Leu Phe Gly lie Met Thr Phe Ser Phe Gly 

50 55 60 65 

gtt ate ttc ctt ttc act ttg tta aaa cca tat cca agg ttt ccc ttt 2 97 

Val He Phe Leu Phe Thr Leu Leu Lys Pro Tyr Pro Arg Phe Pro Phe 
70 75 80 

ata ttt ctt tea gga tat cca ttc tgg ggc tct gtt ttg ttc att aat 345 

He Phe Leu Ser Gly Tyr Pro Phe Trp Gly Ser Val Leu Phe He Asn 
85 90 95 

tct gga gcc ttc eta att gca gtg aaa aga aaa acc aca gaa act ctg 393 

Ser Gly Ala Phe Leu He Ala Val Lys Arg Lys Thr Thr Glu Thr Leu 
100 105 110 

ata ata ttg age cga ata atg aat ttt ctt agt gcc ctg gga gca ata 441 

He He Leu Ser Arg He Met Asn Phe Leu Ser Ala Leu Gly Ala He 
115 120 125 

get gga ate att etc etc aca ttt ggt ttc ate eta gat caa aac tac 489 

Ala Gly He He Leu Leu Thr Phe Gly Phe He Leu Asp Gin Asn Tyr 

130 135 140 145 

att tgt ggt tat tct cac caa aat agt cag tgt aag get gtt act gtc 537 

lie Cys Gly Tyr Ser His Gin Asn Ser Gin Cys Lys Ala Val Thr Val 
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150 155 160 

ctg ttc ttg gga att ttg att aca ttg atg act ttc age att att gaa 

Leu Phe Leu Gly He Leu He Thr Leu Met Thr Phe Ser He He Glu 

165 170 175 



585 



tta ttc att tct ctg cct ttc tea att ttg ggg tgc cac tea gag gat 633 

Leu Phe He Ser Leu Pro Phe Ser He Leu Gly Cys His Ser Glu Asp 

180 185 190 

tgt gat tgt gaa caa tgt tgt tga ctagcactgt gagaataaag atgtgttaaa 687 
Cys Asp Cys Glu Gin Cys Cys 
195 200 



ataaaaa 

<210> 6 

<211> 200 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Asp Ser Ser Thr Ala His Ser Pro Val Phe Leu Val Phe Pro Pro 
15 10 15 



Glu He Thr Ala Ser Glu Tyr Glu Ser Thr Glu Leu Ser Ala Thr Thr 
20 25 30 



Phe Ser Thr Gin Ser Pro Leu Gin Lys Leu Phe Ala Arg Lys Met Lys 
35 40 45 



He Leu Gly Thr He Gin He Leu Phe Gly He Met Thr Phe Ser Phe 
50 55 60 



Gly Val He Phe Leu Phe Thr Leu Leu Lys Pro Tyr Pro Arg Phe Pro 
65 70 75 B0 



Phe He Phe Leu Ser Gly Tyr Pro Phe Trp Gly Ser Val Leu Phe He 
85 90 95 



Asn Ser Gly Ala Phe Leu He Ala Val Lys Arg Lys Thr Thr Glu Thr 
100 105 110 



694 



Leu He He Leu Ser Arg He Met Asn Phe Leu Ser Ala Leu Gly Ala 
115 120 125 
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lie Ala Gly He He Leu Leu Thr Phe Gly Phe He Leu Asp Gin Asn 
130 135 140 

Tyr He Cys Gly Tyr Ser His Gin Asn Ser Gin Cys Lys Ala Val Thr 
145 150 155 160 

Val Leu Phe Leu Gly He Leu He Thr Leu Met Thr Phe Ser He He 
165 170 175 

Glu Leu Phe He Ser Leu Pro Phe Ser He Leu Gly Cys His Ser Glu 
180 185 190 

Asp Cys Asp Cys Glu Gin Cys Cys 
195 200 

<210> 7 

<211> 1152 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1152) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> CDS 

<222> (239) . . (985) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<400> 7 

ttttactgac cttgctagaa gtttacagnc acggaagtgc aggaacattt cacaaatcta 60 

caatctgtga gtatcacatc ctgtatagct gtaaacactg gaataaggaa gggctgatga 120 

ctttcagaag atgaaggtaa gtagaaaccg ttgatgggac tgagaaacca gagttaaaac 180 

ctctttggag cttctgagga ctcagctgga accaacgggc acagttggca acaccatc 238 

atg aca tea caa cct gtt ccc aat gag acc ate ata gtg etc cca tea 286 
Met Thr Ser Gin Pro Val Pro Asn Glu Thr He He Val Leu Pro Ser 
15 10 15 

aat gtc ate aac ttc tec caa gca gag aaa ccc gaa ccc acc aac cag 334 
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Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 

999 cag gat age ctg aag aaa cat eta cac gca gaa att aaa gtt att 

Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu He Lys Val He 
35 40 45 



att ttg gca tct get tec ttc tct cca aat ttt ace caa gtg act tct 

He Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 

65 70 75 80 

aca ctg ttg aac tct get tac cca ttc ata gga ccc ttt ttt ttt ate 

Thr Leu Leu Asn Ser Ala Tyr Pro Phe He Gly Pro Phe Phe Phe He 

85 90 95 



ctg gtg ggt ttc att ate ctg tct gtc aaa cag gec ace tta aat cct 
Leu Val Gly Phe He He Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 



145 



tat gtt tct tac ttt tat cat gat tea ctt tat ace acg gac tgc tat 
Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 
165 170 175 



agt tac att ggt aat tct ggc atg tec tea aaa atg act cat gac tgt 

Ser Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 

225 230 235 240 

gga tat gaa gaa eta ttg act tct taa gaaaaaaggg agaaatatta 

Gly Tyr Glu Glu Leu Leu Thr Ser 
245 



382 



ggg act ate cag ate ttg tgt ggc atg atg gta ttg age ttg ggg ate 430 
Gly Thr He Gin He Leu Cys Gly Met Met Val Leu Ser Leu Gly He 
50 55 60 



478 



526 



ate tct ggc tct eta tea ate gee aca gag aaa agg tta ace aag ctt 574 

He Ser Gly Ser Leu Ser lie Ala Thr Glu Lys Arg Leu Thr Lys Leu 
100 105 110 

ttg gtg cat age age ctg gtt gga age att ctg agt get ctg tct gee 622 

Leu Val His Ser Ser Leu Val Gly Ser He Leu Ser Ala Leu Ser Ala 
115 120 125 



670 



gee tea ctg cag tgt gag ttg gac aaa aat aat ata cca aca aga agt 718 
Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn lie Pro Thr Arg Ser 

150 155 160 



766 



aca gee aaa gec agt ctg get gga act etc tct ctg atg ctg att tgc 814 
Thr Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu lie Cys 
180 185 190 

act ctg ctg gaa ttc tgc eta get gtg etc act get gtg ctg egg tgg 862 
Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 

aaa cag get tac tct gac ttc cct ggg agt gta ctt ttc ctg cct cac 910 
Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 



958 



1005 



atcagaaagt tgattcttat gataatatgg aaaagttaac cattatagaa aagcaaagct 1065 
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tgagtttcct aaatgtaagc ttttaaagta atgaacatta aaaaaaacca ttatttcact 1125 
gtcaaaaaaa aaaaaaaaaa aaaaaaa 1152 

<210> 8 

<211> 248 

<212> PRT 

<213> Homo sapiens 



<400> 8 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie lie Val Leu Pro Ser 
1.5 10 15 



Asn Val He Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 



Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu He Lys Val He 
35 40 45 



Gly Thr He Gin He Leu Cys Gly Met Met Val Leu Ser Leu Gly He 
50 55 60 



lie Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
65 70 75 80 



Thr Leu Leu Asn Ser Ala Tyr Pro Phe He Gly Pro Phe Phe Phe He 
85 90 95 



He Ser Gly Ser Leu Ser He Ala Thr Glu Lys Arg Leu Thr Lys Leu 
100 105 110 



Leu Val His Ser Ser Leu Val Gly Ser lie Leu Ser Ala Leu Ser Ala 
115 120 125 



Leu Val Gly Phe lie lie Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 



Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn lie Pro Thr Arg Ser 
145 150 155 160 



Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 
165 170 175 
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Thr Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu He Cys 
180 185 190 



Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 



Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 



Ser Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 240 



Gly Tyr Glu Glu Leu Leu Thr Ser 
245 



<210> 9 

<211> 930 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDNA 

<222> (1)..(930) 

<220> 

<221> CDS 

<222> (17).. (763) 



<400> 9 

agttggcaac accatc atg aca tea caa cct gtt ccc aat gag acc ate ata 52 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr He He 
15 10 



gtg etc cca tea aat gtc ate aac ttc tec caa gca gag aaa ccc gaa 
Val Leu Pro Ser Asn Val He Asn Phe Ser Gin Ala Glu Lys Pro Glu 
15 20 25 



100 



ccc acc aac cag ggg cag gat age ctg aag aaa cat eta cac gca gaa 14 8 

Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu 

30 35 40 

ate aaa gtt att ggg act ate cag ate ttg tgt ggc atg atg gta ttg 196 

He Lys Val He Gly Thr He Gin He Leu Cys Gly Met Met Val Leu 
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45 50 55 60 

age ttg ggg ate att ttg gca tct get tec ttc tct cca aat ttt ace 244 
Ser Leu Gly lie lie Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr 
65 70 75 

caa gtg act tct aca ctg ttg aac tct get tac cca ttc ata gga ccc 292 
Gin Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr Pro Phe lie Gly Pro 
80 85 90 

ttt ttt ttt ate ate tct ggc tct eta tea ate gee aca gag aaa agg 340 
Phe Phe Phe lie lie Ser Gly Ser Leu Ser lie Ala Thr Glu Lys Arg 
95 100 105 

tta ace aag ctt ttg gtg cat age age ctg gtt gga age att ctg agt 3 88 

Leu Thr Lys Leu Leu Val His Ser Ser Leu Val Gly Ser lie Leu Ser 
110 115 120 



get ctg tct gee ctg gtg ggt ttc att 
Ala Leu Ser Ala Leu Val Gly Phe He 
125 130 

ace tta aat cct gee tea ctg cag tgt 
Thr Leu Asn Pro Ala Ser Leu Gin Cys 
145 

cca aca aga agt tat gtt tct tac ttt 
Pro Thr Arg Ser Tyr Val Ser Tyr Phe 
160 165 



ate ctg tct gtc aaa cag gee 436 
He Leu Ser Val Lys Gin Ala 
135 140 

gag ttg gac aaa aat aat ata 4 84 

Glu Leu Asp Lys Asn Asn He 
150 155 

tat cat gat tea ctt tat ace 532 
Tyr His Asp Ser Leu Tyr Thr 
170 



acg gac tgc tat aca gec aaa gee agt ctg get gga tct etc tct ctg 580 
Thr Asp Cys Tyr Thr Ala Lys Ala Ser Leu Ala Gly Ser Leu Ser Leu 
175 180 185 

atg ctg att tgc act ctg ctg gaa ttc tgc eta get gtg etc act get 628 
Met Leu He Cys Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala 
190 195 200 

gtg ctg egg tgg aaa cag get tac tct gac ttc cct ggg agt gta ctt 676 
Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu 
205 210 215 220 

ttc ctg cct cac agt tac att ggt aat tct ggc atg tec tea aaa atg 724 
Phe Leu Pro His Ser Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met 
225 230 235 

act cat gac tgt gga tat gaa gaa eta ttg act tct taa gaaaaaaggg 773 
Thr His Asp Cys Gly Tyr Glu Glu Leu Leu Thr Ser 
240 245 

agaaatatta atcagaaagt tgattcttat gataatatgg aaaagttaac cattatagaa 833 

aagcaaagct tgagtttcct aaatgtaagc ttttaaagta atgaacatta aaaaaaacca 893 

ttatttcact gtcaaaaaaa aaaaaaaaaa aaaaaaa 930 



<210> 10 
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<211> 248 
<212> PRT 
<213> Homo sapiens 

<400> 10 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr He He Val Leu Pro Ser 
15 10 15 



Asn Val He Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 



Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu He Lys Val He 
35 40 45 



Gly Thr He Gin He Leu Cys Gly Met Met Val Leu Ser Leu Gly He 
50 55 60 



He Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
65 70 75 80 



Thr Leu Leu Asn Ser Ala Tyr Pro Phe He Gly Pro Phe Phe Phe He 
85 90 95 



He Ser Gly Ser Leu Ser He Ala Thr Glu Lys Arg Leu Thr Lys Leu 
100 105 110 



Leu Val His Ser Ser Leu Val Gly Ser He Leu Ser Ala Leu Ser Ala 
115 120 125 



Leu Val Gly Phe He lie Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 



Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn He Pro Thr Arg Ser 
145 150 155 160 



Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 
165 170 175 



Thr Ala Lys Ala Ser Leu Ala Gly Ser Leu Ser Leu Met Leu He Cys 
180 185 190 



Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 
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Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 

Ser Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 240 

Gly Tyr Glu Glu Leu Leu Thr .Ser 
245 

<210> 11 

<211> 661 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (15).. (458) 



<400> 11 

ttggcaacac catt atg aca tea caa cct att tec aat gag ace ate ata 50 

Met Thr Ser Gin Pro He Ser Asn Glu Thr He He 

15 10 

atg etc cca tea aat gtc ate aac ttc tec caa gca gag aaa ccc gaa 98 

Met Leu Pro Ser Asn Val He Asn Phe Ser Gin Ala Glu Lys Pro Glu 
15 20 25 

ccc ace aac cag ggg cag gat age ctg aag aaa cgt eta cag gca aaa 146 

Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys Arg Leu Gin Ala Lys 
30 35 40 



gtc aaa gtt att ggg gtg cat age age ctg get gga age att ctg agt 194 
Val Lys Val He Gly Val His Ser Ser Leu Ala Gly Ser He Leu Ser 
45 50 55 60 

get ctg tct gee ctg gtg ggt ttc att etc ctg tct gtc aac ccg get 242 
Ala Leu Ser Ala Leu Val Gly Phe He Leu Leu Ser Val Asn Pro Ala 
65 70 75 

gca tta aat cct gee tea ttg cag tgt aag ttg gac gaa aag gat ata 290 
Ala Leu Asn Pro Ala Ser Leu Gin Cys Lys Leu Asp Glu Lys Asp He 
80 85 90 

cca ace aga ctt ctt ctt tct tat gat tat cat tea cct tac acc atg 338 
Pro Thr Arg Leu Leu Leu Ser Tyr Asp Tyr His Ser Pro Tyr Thr Met 
95 100 105 



-15- 



WO 02/062946 



PCTAJS01/48437 



gac tgc cat aga gcc aaa gcc agt ctg get gga act ctg tct ctg atg 
Asp Cys His Arg Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met 
110 H5 120 



386 



ctg gtt tct act gtg ttg gag ttc tgc eta get gtg etc act get gtg 434 
Leu Val Ser Thr Val Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val 
125 130 135 140 

ctg cag tgg aaa cag act gtc tga cttccctggt tactcatgtc ctcaaagacg 488 
Leu Gin Trp Lys Gin Thr Val 
145 

actcatgatg ctggatatga agaactattg acttcttggg aaaaaacgga gaaatattaa 548 

ttggaaagtt gatttttatg ataatatgga aaacctaacc attataaaaa agcaaacttg 608 

agtttcctaa atgtaagcat ttaaagtaaa tgcatatttg ttttaaaaaa tta 661 

<210> 12 

<211> 147 

<212> PRT 

<213> Homo sapiens 



<400> 12 

Met Thr Ser Gin Pro He Ser Asn Glu Thr He He Met Leu Pro Ser 
15 10 15 



Asn Val He Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 



Gly Gin Asp Ser Leu Lys Lys Arg Leu Gin Ala Lys Val Lys Val He 
35 40 45 



Gly Val His Ser Ser Leu Ala Gly Ser He Leu Ser Ala Leu Ser Ala 
50 55 60 



Leu Val Gly Phe He Leu Leu Ser Val Asn Pro Ala Ala Leu Asn Pro 
65 ' 70 75 80 



Ala Ser Leu Gin Cys Lys Leu Asp Glu Lys Asp He Pro Thr Arg Leu 
85 90 95 



Leu Leu Ser Tyr Asp Tyr His Ser Pro Tyr Thr Met Asp Cys His Arg 
100 105 HO 



Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu Val Ser Thr 
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115 120 125 



Val Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Gin Trp Lys 
130 135 140 



Gin Thr Val 
145 



<210> 13 

<211> 1228 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDNA 

<222> (1) . . (1228) 

<220> 

<221> CDS 

<222> (118) . . (840) 



<400> 13 

agattgttct ggcaaggaac agccagtggg aggttccagc tgagcgctcc ccagaggtga 60 

gctgatcccc agccacagca cacaggacca ggctgcgaga acagcatcat cagcatc 117 

atg eta tta caa tec caa ace atg ggg gtt tct cac age ttt aca cca 165 
Met Leu Leu Gin Ser Gin Thr Met Gly Val Ser His Ser Phe Thr Pro 
15 10 15 

aag ggc ate act ate cct caa aga gag aaa cct gga cac atg tac caa 213 
Lys Gly lie Thr lie Pro Gin Arg Glu Lys Pro Gly His Met Tyr Gin 
20 25 30 

aac gaa gat tac ctg cag aac ggg ctg cca aca gaa ace ace gtt ctt 261 
Asn Glu Asp Tyr Leu Gin Asn Gly Leu Pro Thr Glu Thr Thr Val Leu 
35 40 45 

ggg act gtc cag ate ctg tgt tgc ctg ttg att tea agt ctg ggg gec . 309 
Gly Thr Val Gin lie Leu Cys Cys Leu Leu lie Ser Ser Leu Gly Ala 
50 55 60 

ate ttg gtt ttt get ccc tac ccc tec cac ttc aat cca gca att tec 357 
lie Leu Val Phe Ala Pro Tyr Pro Ser His Phe Asn Pro Ala lie Ser 
65 70 75 80 
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acc act ttg atg tct ggg tac cca ttt tta gga get ctg tgt ttt ggc 405 
Thr Thr Leu Met Ser Gly Tyr Pro Phe Leu Gly Ala Leu Cys Phe Gly 
85 90 95 

att act gga tec etc tea att ate tct gga aaa caa tea act aag ccc 453 
He Thr Gly Ser Leu Ser He He Ser Gly Lys Gin Ser Thr Lys Pro 
100 105 HO 



ttt gac ctg age age ttg acc tea aat gca gtg agt tct gtt act gca 
Phe Asp Leu Ser Ser Leu Thr Ser Asn Ala Val Ser Ser Val Thr Ala 
115 120 125 



gcc tct caa cat tgt ggc tea gaa atg gat tat eta tec tea ttg cet 
Ala Ser Gin His Cys Gly Ser Glu Met Asp Tyr Leu Ser Ser Leu Pro 
145 150 155 160 

tat teg gag tac tat tat cca ata tat gaa ate aaa gat tgt etc ctg 
Tyr Ser Glu Tyr Tyr Tyr Pro He Tyr Glu He Lys Asp Cys Leu Leu 
165 170 175 



501 



gga gca ggc etc ttc etc ctt get gac age atg gta gcc ctg agg act 54 9 

Gly Ala Gly Leu Phe Leu Leu Ala Asp Ser Met Val Ala Leu Arg Thr 
130 135 140 



597 



645 



acc agt gtc agt tta aca ggt gtc eta gtg gtg atg etc ate ttc act 693 
Thr Ser Val Ser Leu Thr Gly Val Leu Val Val Met Leu He Phe Thr 
180 185 190 

gtg ctg gag etc tta tta get gca tac agt tct gtc ttt tgg tgg aaa 741 
Val Leu Glu Leu Leu Leu Ala Ala Tyr Ser Ser Val Phe Trp Trp Lys 
195 200 205 

cag etc tac tec aac aac cet ggg agt tea ttt tec teg acc cag tea 789 
Gin Leu Tyr Ser Asn Asn Pro Gly Ser Ser Phe Ser Ser Thr Gin Ser 
210 215 220 

caa gat cat ate caa cag gtc aaa aag agt tct tea egg tct tgg ata 837 
Gin Asp His He Gin Gin Val Lys Lys Ser Ser Ser Arg Ser Trp He 
225 230 235 240 

taa gtaactcttg gectcagagg aaggaaaagc aactcaacac tcatggtcaa 890 

gtgtgattag actttcctga aatctctgcc attttagata ctgtgaaaca aactaaaaaa 950 

aaaaaagctt ttgttttgta tttgtttact atgagtcgtt atttaatttc tcttgaaaat 1010 

aatttcctca aagcccaagt caataaatgt tatcagecag tcttccaaaa tggtcataaa 1070 

ctttataaac tgctttgggt aaactgagca gaaggtgata cacagaaggg aaaatgtgca 1130 

etcatgetag tgtgaatttg gtaagtcgcg tgactctgea ggctgtttct gtattatttt 1190 

cacactcata ttgcttaaat attacatatt agggattg 1228 



<210> 14 
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<211> 240 
<212> PRT 
<213> Homo sapiens 

<400> 14 

Met Leu Leu Gin Ser Gin Thr Met Gly Val Ser His Ser Phe Thr Pro 
15 10 15 



Lys Gly lie Thr lie Pro Gin Arg Glu Lys Pro Gly His Met Tyr Gin 
20 25 30 



Asn Glu Asp Tyr Leu Gin Asn Gly Leu Pro Thr Glu Thr Thr Val Leu 
35. 40 45 



Gly Thr Val Gin lie Leu Cys Cys Leu Leu lie Ser Ser Leu Gly Ala 
50 55 60 



He Leu Val Phe Ala Pro Tyr Pro Ser His Phe Asn Pro Ala He Ser 
65 70 75 80 



Thr Thr Leu Met Ser Gly Tyr Pro Phe Leu Gly Ala Leu Cys Phe Gly 
85 90 95 



He Thr Gly Ser Leu Ser lie He Ser Gly Lys Gin Ser Thr Lys Pro 
100 105 110 



Phe Asp Leu Ser Ser Leu Thr Ser Asn Ala Val Ser Ser Val Thr Ala 
115 120 125 



Gly Ala Gly Leu Phe Leu Leu Ala Asp Ser Met Val Ala Leu Arg Thr 
130 135 140 



Ala Ser Gin His Cys Gly Ser Glu Met Asp Tyr Leu Ser Ser Leu Pro 
145 150 155 160 



Tyr Ser Glu Tyr Tyr Tyr Pro He Tyr Glu He Lys Asp Cys Leu Leu 
165 170 175 



Thr Ser Val Ser Leu Thr Gly Val Leu Val Val Met Leu He Phe Thr 
180 185 ' 190 



Val Leu Glu Leu Leu Leu Ala Ala Tyr Ser Ser Val Phe Trp Trp Lys 
195 200 205 
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Gin Leu Tyr Ser Asn Asn Pro Gly Ser Ser Phe Ser Ser Thr Gin Ser 
210 215 220 



Gin Asp His lie Gin Gin Val Lys Lys Ser Ser Ser Arg Ser Trp lie 
225 230 235 240 



<210> 15 

<211> 1369 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1369) 

<220> 

<221> CDS 

<222> (204) . . (956) 



<400> 15 

aaacaggaaa taaatacgaa tgaaactgag ctctaagcag catgtaacct ggcctgcatc 60 

caggaaatag aggacttcgg atccttctaa ccctaccacc caactggccc cagtacattc 120 

attctctcag gaaaaaaaac aaggtcccca cagcaaagaa aaggaatagg atcaagagat 180 

acgtggctgc tggcagagca age atg aat teg atg act tea gca gtt ccg gtg 233 

Met Asn Ser Met Thr Ser Ala Val Pro Val 
15 10 

gec aat tct gtg ttg gtg gtg gca ccc cac aat ggt tat cct gtg acc 281 
Ala Asn Ser Val Leu Val Val Ala Pro His Asn Gly Tyr Pro Val Thr 
15 20 25 

cca gga att atg tct cac gtg ccc ctg tat cca aac age cag ccg caa 329 
Pro Gly lie Met Ser His Val Pro Leu Tyr Pro Asn Ser Gin Pro Gin 
30 35 40 

gtc cac eta gtt cct ggg aac cca cct agt ttg gtg teg aat gtg aat 377 
Val His Leu Val Pro Gly Asn Pro Pro Ser Leu Val Ser Asn Val Asn 
45 50 55 

ggg cag cct gtg cag aaa get ctg aaa gaa ggc aaa acc ttg ggg gee 425 
Gly Gin Pro Val Gin Lys Ala Leu Lys Glu Gly Lys Thr Leu Gly Ala 
60 65 70 
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ate cag ate ate att ggc ctg get cac ate ggc etc ggc tec ate atg 473 
lie Gin lie He He Gly Leu Ala His lie Gly Leu Gly Ser He Met 
75 80 85 90 

gcg acg gtt etc gta ggg gaa tac ctg tct att tea ttc tac gga ggc 521 
Ala Thr Val Leu Val Gly Glu Tyr Leu Ser He Ser Phe Tyr Gly Gly 
95 100 105 

ttt ccc ttc tgg gga ggc ttg tgg ttt ate att tea gga tct etc tec 569 
Phe Pro Phe Trp Gly Gly Leu Trp Phe He He Ser Gly Ser Leu Ser 
110 115 120 

gtg gca gca gaa aat cag cca tat tct tat tgc ctg ctg tct ggc agt 617 
Val Ala Ala Glu Asn Gin Pro Tyr Ser Tyr Cys Leu Leu Ser Gly Ser 
125 130 135 

ttg ggc ttg aac ate gtc agt gca ate tgc tct gca gtt gga gtc ata 665 
Leu Gly Leu Asn He Val Ser Ala He Cys Ser Ala Val Gly Val He 
140 145 150 



etc ttc ate aca gat eta agt att ccc cac cca tat gee tac ccc gac 713 
Leu Phe He Thr Asp Leu Ser lie Pro His Pro Tyr Ala Tyr Pro Asp 
155 160 165 170 

tat tat cct tac gee tgg ggt gtg aac cct gga atg gcg att tct ggc 761 
Tyr Tyr Pro Tyr Ala Trp Gly Val Asn Pro Gly Met Ala He Ser Gly 
175 180 185 

gtg ctg ctg gtc ttc tgc etc ctg gag ttt ggc ate gca tgc gca tct 809 
Val Leu Leu Val Phe Cys Leu Leu Glu Phe Gly He Ala Cys Ala Ser 
190 195 200 

tec cac ttt ggc tgc cag ttg gtc tgc tgt caa tea age aat gtg agt 857 
Ser His Phe Gly Cys Gin Leu Val Cys Cys Gin Ser Ser Asn Val Ser 
205 210 215 

gtc ate tat cca aac ate tat gca gca aac cca gtg ate acc cca gaa 905 
Val He Tyr Pro Asn He Tyr Ala Ala Asn Pro Val He Thr Pro Glu 
220 225 230 

ccg gtg acc tea cca cca agt tat tec agt gag ate caa gca aat aag 953 
Pro Val Thr Ser Pro Pro Ser Tyr Ser Ser Glu He Gin Ala Asn Lys 
235 240 245 250 

taa ggctacagat tctggaagca tctttcactg ggaccaaaag aagtcctcct 1006 

ccctttctgg gcttccataa cccaggtcgt tcctgttctg acagctgagg aaaegtctet 1066 

cccactgttt gtactctcac cttcattctt caattcagtc taggaaacca tgctgtttct. 1126 

ctatcaagaa gaagacagag attttaaaca gatgttaacc aagagggact ccctagggca 1186 

catgeatcag cacatatgtg ggcatccagc ctctggggcc ttggcacaca cacattegtg 1246 

tgctctgctg catgtgagct tgtgggttag aggaacaaat atctagacat tcaatcttca 1306 

ctctttcaat tgtgcattca tttaataaat agatactgag cattcaaaaa aaaaaaaaaa 1366 

aaa 1369 
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<210> 16 

<211> 250 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Asn Ser Met Thr Ser Ala Val Pro Val Ala Asn Ser Val Leu Val 
15 10 15 



Val Ala Pro His Asn Gly Tyr Pro Val Thr Pro Gly lie Met Ser His 
20 25 30 



Val Pro Leu Tyr Pro Asn Ser Gin Pro Gin Val His Leu Val Pro Gly 
35 40 45 



Asn Pro Pro Ser Leu Val Ser Asn Val Asn Gly Gin Pro Val Gin Lys 
50 55 60 



Ala Leu Lys Glu Gly Lys Thr Leu Gly Ala He Gin He He He Gly 
65 70 75 80 



Leu Ala His He Gly Leu Gly Ser He Met Ala Thr Val Leu Val Gly 
85 90 95 



Glu Tyr Leu Ser He Ser Phe Tyr Gly Gly Phe Pro Phe Trp Gly Gly 
100 105 HO 



Leu Trp Phe He He Ser Gly Ser Leu Ser Val Ala Ala Glu Asn Gin 
115 120 125 



Pro Tyr Ser Tyr Cys Leu Leu Ser Gly Ser Leu Gly Leu Asn He Val 
130 135 140 



Ser Ala He Cys Ser Ala Val Gly Val He Leu Phe He Thr Asp Leu 
145 150 155 160 



Ser He Pro His Pro Tyr Ala Tyr Pro Asp Tyr Tyr Pro Tyr Ala Trp 
165 170 175 



Gly *Val Asn Pro Gly Met Ala He Ser Gly Val Leu Leu Val Phe Cys 
180 185 190 
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Leu Leu Glu Phe Gly lie Ala Cys Ala Ser Ser His Phe Gly Cys Gin 
195 200 205 



Leu Val Cys Cys Gin Ser Ser Asn Val Ser Val lie Tyr Pro Asn He 
210 215 220 



Tyr Ala Ala Asn Pro Val He Thr Pro Glu Pro Val Thr Ser Pro Pro 
225 230 235 240 



Ser Tyr Ser Ser Glu He Gin Ala Asn Lys 
245 250 



<210> 17 

<211> 726 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (726) 



<400> 17 

atg aaa gca gaa gcc aca gtt att ccc age cgt tgt get agg ggg etc 48 
Met Lys Ala Glu Ala Thr Val He Pro Ser Arg Cys Ala Arg Gly Leu 
15 10 15 

cca tea tgg caa gtc etc age cca gtc cag ccc tgg cag aca agt gca 96 
Pro Ser Trp Gin Val Leu Ser Pro Val Gin Pro Trp Gin Thr Ser Ala 
20 25 ~ 30 

ccc cag aac acg acc cag ccc aag etc ctg get cca cac cag cac gag 144 
Pro Gin Asn Thr Thr Gin Pro Lys Leu Leu Ala Pro His Gin His Glu 
35 40 45 

aag tec cag aag aag age age ctt ctt aag gag ctg ggg gcc ttc cac 192 
Lys Ser Gin Lys Lys Ser Ser Leu Leu Lys Glu Leu Gly Ala Phe His 
50 55 60 

ate acc ate get ctg ctg cac ctg gtc ttt ggg ggc tac ctg gcc tct 240 
He Thr He Ala Leu Leu His Leu Val Phe Gly Gly Tyr Leu Ala Ser 
65 70 75 80 

ata gtc aag aac ctt cac ctg gtg gtg ctg aag tct tgg tat cca ttc 288 
He Val Lys Asn Leu His Leu Val Val Leu Lys Ser Trp Tyr Pro Phe 
85 90 95 
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tgg ggg get gec tct ttt etc att tea ggg ate ttg gcg ata aca atg 3: 
Trp Gly Ala Ala Ser Phe Leu He Ser Gly He Leu Ala He Thr Met 
100 105 110 

aag ace ttt tct aaa act tac ctg aag atg ttg tgc ctg atg aca aac 3 
Lys Thr Phe Ser Lys Thr Tyr Leu Lys Met Leu Cys Leu Met Thr Asn 
115 120 125 

etc ate age etc ttt tgc gtg ctg tct ggc etc ttc gtc ate tec aag 4 
Leu He Ser Leu Phe Cys Val Leu Ser Gly Leu Phe Val He Ser Lys 
130 135 140 

gat etc ttt ctg gag age cca ttt gag tec ccg ate tgg aga atg tac 4 
Asp Leu Phe Leu Glu Ser Pro Phe Glu Ser Pro He Trp Arg Met Tyr 
145 150 155 160 



ccc aac tec acg gtc cac ate cag agg ctg gag ctg gee ttg etc tgc 
Pro Asn Ser Thr Val His lie Gin Arg Leu Glu Leu Ala Leu Leu Cys 
165 170 175 

ttc act gtc eta gag etc ttc ctg cca gtg ccc aca get gtc aca gee 
Phe Thr Val Leu Glu Leu Phe Leu Pro Val Pro Thr Ala Val Thr Ala 
180 185 190 

tgg aga ggg gac tgc cca tct gca aag aat gat gat gca tgc ctt gtt 
Trp Arg Gly Asp Cys Pro Ser Ala Lys Asn Asp Asp Ala Cys Leu Val 
195 200 205 

ccg aat aca cca ttg cat etc aaa ggc ctg ccg gtg gag ccc ccg cca 
Pro Asn Thr Pro Leu His Leu Lys Gly Leu Pro Val Glu Pro Pro Pro 
210 215 220 

tec tac cag agt gtg att caa ggc gac gca caa cac aag caa cat cag 
Ser Tyr Gin Ser Val He Gin Gly Asp Ala Gin His Lys Gin His Gin 
225 230 235 240 

agg tga 
Arg 



<210> 18 

<211> 241 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Met Lys Ala Glu Ala Thr Val He Pro Ser Arg Cys Ala Arg Gly Leu 
15 10 15 



Pro Ser Trp Gin Val Leu Ser Pro Val Gin Pro Trp Gin Thr Ser Ala 
20 25 30 



Pro Gin Asn Thr Thr Gin Pro Lys Leu Leu Ala Pro His Gin His Glu 



5 
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35 



40 



45 



Lys Ser Gin Lys Lys Ser Ser Leu Leu Lys Glu Leu Gly Ala Phe His 
50 55 60 



lie Thr He Ala Leu Leu His Leu Val Phe Gly Gly Tyr Leu Ala Ser 
65 70 75 80 



He Val Lys Asn Leu His Leu Val Val Leu Lys Ser Trp Tyr Pro Phe 
85 90 95 



Trp Gly Ala Ala Ser Phe Leu He Ser Gly He Leu Ala He Thr Met 
100 105 110 



Lys Thr Phe Ser Lys Thr Tyr Leu Lys Met Leu Cys Leu Met Thr Asn 
115 120 125 



Leu He Ser Leu Phe Cys Val Leu Ser Gly Leu Phe Val He Ser Lys 
130 135 140 



Asp Leu Phe Leu Glu Ser Pro Phe Glu Ser Pro He Trp Arg Met Tyr 
145 150 155 160 



Pro Asn Ser Thr Val His He Gin Arg Leu Glu Leu Ala Leu Leu Cys 
165 170 175 



Phe Thr Val Leu Glu Leu Phe Leu Pro Val Pro Thr Ala Val Thr Ala 
180 185 190 



Trp Arg Gly Asp Cys Pro Ser Ala Lys Asn Asp Asp Ala Cys Leu Val 
195 200 205 



Pro Asn Thr Pro Leu His Leu Lys Gly Leu Pro Val Glu Pro Pro Pro 
210 215 220 



Ser Tyr Gin Ser Val He Gin Gly Asp Ala Gin His Lys Gin His Gin 
225 230 235 240 



Arg 



<210> 19 
<211> 518 
<212> DNA 
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<213> Mus musculus 
<220> 

<221> cDNA 

<222> (1) . . (518) 

<220> 

<221> CDS 

<222> (148) . . (516) 

<400> 19 

taagcgctga ggaggaagag actgctggtt ttgggggaca gactctggtg gtcattactg 60 

tctcctcttc tgtaatgagt tggacttgca ggggaaggac ttgtcaaatc ctgaattctt 120 

caaggtggta cctcatcagg cagacac atg aag cca gag gag act ggt ggt tct 174 

Met Lys Pro Glu Glu Thr Gly Gly Ser 
1 5 

gtt tat cag ccc ttg gat gag tea cgc cat gtt caa cga ggt gta ctg 222 
Val Tyr Gin Pro Leu Asp Glu Ser Arg His Val Gin Arg Gly Val Leu 
10 15 20 25 



caa gec etc ggg gec ate cag ate ctg aat gga ata ctg att eta get 270 

Gin Ala Leu Gly Ala lie Gin lie Leu Asn Gly lie Leu lie Leu Ala 

30 35 40 

etc gga att ttt ctg gtt tgt tta caa cac gtg tec cac cac ttc agg 318 

Leu Gly lie Phe Leu Val Cys Leu Gin His Val Ser His His Phe Arg 

45 50 55 



cat ttc ttc ttc ttc acc ttc tac aca ggc tac cca ctg tgg ggt get 366 

His Phe Phe Phe Phe Thr Phe Tyr Thr Gly Tyr Pro Leu Trp Gly Ala 
60 65 70 

gtg ttt ttt ate age tea gga tec ttg act gtt gee gca ggg aga aac 414 

Val Phe Phe He Ser Ser Gly Ser Leu Thr Val Ala Ala Gly Arg Asn 

75 80 85 

ccc aca cga atg ctg atg caa aac agt ttt ggg ata aac att gec agt 462 

Pro Thr Arg Met Leu Met Gin Asn Ser Phe Gly He Asn He Ala Ser 

90 95 100 105 



act aca att gca ttt gtt ggg act gtt ttc ctt tct gtg cat ttg gca 510 
Thr Thr He Ala Phe Val Gly Thr Val Phe Leu Ser Val His Leu Ala 
110 115 120 

ttc aat ac 518 
Phe Asn 
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<210> 20 

<211> 123 

<212> PRT 

<213> Mus musculus 

<400> 20 

Met Lys Pro Glu Glu Thr Gly Gly Ser Val Tyr Gin Pro Leu Asp Glu 
15 10 15 



Ser Arg His Val Gin Arg Gly Val Leu Gin Ala Leu Gly Ala He Gin 
20 25 30 



He Leu Asn Gly He Leu He Leu Ala Leu Gly He Phe Leu Val Cys 
35 40 45 



Leu Gin His Val Ser His His Phe Arg His Phe Phe Phe Phe Thr Phe 
50 55 60 



Tyr Thr Gly Tyr Pro Leu Trp Gly Ala Val Phe Phe He Ser Ser Gly 
65 70 75 80 



Ser Leu Thr Val Ala Ala Gly Arg Asn Pro Thr Arg Met Leu Met Gin 
85 90 95 



Asn Ser Phe Gly He Asn He Ala Ser Thr Thr He Ala Phe Val Gly 
100 105 110 



Thr Val Phe Leu Ser Val His Leu Ala Phe Asn 
115 120 



<210> 21 

<211> 1228 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDNA 
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<222> (1) . . (1228) 
<220> 

<221> CDS 

<222> (98) . . (778) 

<400> 21 

ttcggcacga ggcacacaag caaagcttct cctggaataa atcagctgac atctgatcct 60 

ttccattgtc tgtactgttt ctgctgctct ggaaacc atg caa gga cag gaa cag 115 

Met Gin Gly Gin Glu Gin 
1 5 



acc acc atg gca gtg gtt cct gga gtt get gtg cct tea aag aat tct 163 
Thr Thr Met Ala Val Val Pro Gly Val Ala Val Pro Ser Lys Asn Ser 
10 15 20 

gtt atg aca tea caa atg tgg aat gag aag aaa gag aaa ttc ttg aag 211 
Val Met Thr Ser Gin Met Trp Asn Glu Lys Lys Glu Lys Phe Leu Lys 
25 30 35 

ggg gaa ccc aaa gtc ctt ggg gtt tta caa gtg atg att get ate ata 259 
Gly Glu Pro Lys Val Leu Gly Val Leu Gin Val Met lie Ala lie He 
40 45 50 

aac etc age tta gga ata ata att ttg aca act tta ttt tct gaa eta 307 
Asn Leu Ser Leu Gly He He He Leu Thr Thr Leu Phe Ser Glu Leu 
55 60 65 70 

ccc act tea gtg atg tta atg gtc cca att tgg gga tea ata atg ttc 355 
Pro Thr Ser Val Met Leu Met Val Pro He Trp Gly Ser He Met Phe 
75 80 85 

att gtc tec gga tec ctg tec att gca gca gga gtg aca cct aca aaa 403 
He Val Ser- Gly Ser Leu Ser He Ala Ala Gly Val Thr Pro Thr Lys 
90 95 100 

tgc ctg ate gtt gee agt eta act ctg aac act ate acc tct gtg ttg 451 
Cys Leu He Val Ala Ser Leu Thr Leu Asn Thr He Thr Ser Val Leu 
105 110 115 

get gca act gca age ata atg ggt gta gtc agt gtg get gtg ggt tea 499 
Ala Ala Thr Ala Ser He Met Gly Val Val Ser Val Ala Val Gly Ser 
120 125 130 

cag ttt ccg ttt egg tat aat tat aca ate acc aag ggt ttg gat gtt 547 
Gin Phe Pro Phe Arg Tyr Asn Tyr Thr lie Thr Lys Gly Leu Asp Val 
135 140 145 150 

ttg atg tta att ttc aat atg eta gaa ttc tgc ctt get gtg tec gtc 595 
Leu Met Leu He Phe Asn Met Leu Glu Phe Cys Leu Ala Val Ser Val 
155 160 165 

tct gca ttt gga tgt gaa get tec tgt tgt aac tec cgt gag gtt ctt 643 
Ser Ala Phe Gly Cys Glu Ala Ser Cys Cys Asn Ser Arg Glu Val Leu 
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170 175 180 

gta gtg eta cca tea aat cct gtt gag aca gtg atg gca ccc cca atg 691 
Val Val Leu Pro Ser Asn Pro Val Glu Thr Val Met Ala Pro Pro Met 
185 190 195 

aca ctt caa cca ttg eta cca tea gaa cac caa ggg ace aat gtt cca 739 
Thr Leu Gin Pro Leu Leu Pro Ser Glu His Gin Gly Thr Asn Val Pro 
200 205 210 

gga aat gtg tac aag aac cac cca gga gaa ata gtc taa ttttgatgtg 788 
Gly Asn Val Tyr Lys Asn His Pro Gly Glu He Val 
215 220 225 

tgtgtgtgtg tatttcccta ggatattaac acttcattgc actggctttt gaggtgaata 848 

ttagatttac tgtaagtatg taagtcaagc acttattagg tcaacaacac ttcaacatat 908 

tatattcatt gtatgtacaa ggggcaatga atttgeaaag atgttttgaa agcaaacaga 968 

aaaaaaaaaa aaacaaccaa acaaaagacc tcttagtgaa atgaggtctc tttgeaaaga 1028 

ctaaaaaact ggagttcaca tttttggggg tggggggggc ttttgctaaa taagtagatt 1088 

tagatgettt tgatcaagta caaactcata aagtatgtaa gaaattacta atgataggaa 1148 

ccaattttaa tctcatatgt aacatagtgt ataatttaat agatctggta aaaatttata 12 08 

ataaagagaa atgcctgaaa 1228 

<210> 22 

<211> 226 

<212> PRT 

<213> Mus musculus 



<400> 22 

Met Gin Gly Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Val Ala 
15 10 15 



Val Pro Ser Lys Asn Ser Val Met Thr Ser Gin Met Trp Asn Glu Lys 
20 25 30 



Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Leu Gin 
35 40 45 



Val Met He Ala He He Asn Leu Ser Leu Gly He He He Leu Thr 
50 55 60 



Thr Leu Phe Ser Glu Leu Pro Thr Ser Val Met Leu Met Val Pro He 
65 70 75 80 
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Trp Gly Ser He Met Phe He Val Ser Gly Ser Leu Ser He Ala Ala 
85 90 95 



Gly Val Thr Pro Thr Lys Cys Leu He Val Ala Set Leu Thr Leu Asn 
100 105 110 



Thr He Thr Ser Val Leu Ala Ala Thr Ala Ser He Met Gly Val Val 
115 120 125 



Ser Val Ala Val Gly Ser Gin Phe Pro Phe Arg Tyr Asn Tyr Thr He 
130 135 140 



Thr Lys Gly Leu Asp Val Leu Met Leu He Phe Asn Met Leu Glu Phe 
145 150 155 160 



Cys Leu Ala Val Ser Val Ser Ala Phe Gly Cys Glu Ala Ser Cys Cys 
165 170 175 



Asn Ser Arg Glu Val Leu Val Val Leu Pro Ser Asn Pro Val Glu Thr 
180 185 19.0 



Val Met Ala Pro Pro Met Thr Leu Gin Pro Leu Leu Pro Ser Glu His 
195 200 205 



Gin Gly Thr Asn Val Pro Gly Asn Val Tyr Lys Asn His Pro Gly Glu 
210 215 220 



He Val 
225 



<210> 23 

<211> 1480 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1)..(1480) 
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<220> 

<221> CDS 

<222> (107) . . (787) 



<400> 23 

attcggcacg atgggtgaga acacacaatc aaaacttctc ctggaatata tcatctgaca 60 

tcagatcctt cccattgtct gtactgtttc tgctgccctg gaaacc atg caa gga 115 

Met Gin Gly 
1 

cag gaa cag acc acc atg gca gtg gtt cct gga ggt get cca cct tea 163 
Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala Pro Pro Ser 
5 10 15 

gag aat tct gtt atg aaa tea caa atg tgg aac gag aac aag gag aaa 211 
Glu Asn Ser Val Met Lys Ser Gin Met Trp Asn Glu Asn Lys Glu Lys 
20 25 30 35 

ttc ttg aag ggg gaa ccc aaa gtc ctt ggg gtt gta caa gtt atg att 259 
Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Val Gin Val Met lie 
40 45 50 

get etc ata aac etc age ttc gga ata ata att ttg gca aat eta tct 307 
Ala Leu lie Asn Leu Ser Phe Gly He He He Leu Ala Asn Leu Ser 
55 60 65 



tct gaa cca etc att tct gtg gtc tta atg get cca att tgg gga cca 355 

Ser Glu Pro Leu He Ser Val Val Leu Met Ala Pro He Trp Gly Pro 
70 75 80 

ata atg ttc att gtc tea gga tec ctg tea att gca gca gga gtg aga 403 

He Met Phe He Val Ser Gly Ser Leu Ser He Ala Ala Gly Val Arg 

85 90 95 

cct aca aaa aag ctg ate ate age agt eta act ctg aac act ate acc 451 

Pro Thr Lys Lys Leu lie He Ser Ser Leu Thr Leu Asn Thr He Thr 
100 105 110 115 

tct gtg ttg get gca act gca age ata atg ggt gta gtc agt gtg get 499 

Ser Val Leu Ala Ala Thr Ala Ser He Met Gly Val Val Ser Val Ala 
120 125 130 

9tg 99t tca ca 9 fctt cc 9 fctt c 99 tat aat tat aca atc acc aa 9 99 fc 547 

Val Gly Ser Gin Phe Pro Phe Arg Tyr Asn Tyr Thr lie Thr Lys Gly 

135 140 145 

ttg gat att ttg atg tta att tta aat atg eta gaa ttc tgc att get 595 

Leu Asp He Leu Met Leu He Leu Asn Met Leu Glu Phe Cys He Ala 
150 155 160 

gtg tec atc tct get ttt gga tgt aaa get tec tgt tgt aac tec age 643 

Val Ser He Ser Ala Phe Gly Cys Lys Ala Ser Cys Cys Asn Ser Ser 

165 170 175 

gag gtt ctt gta gtg eta cca tca aat cct get gtg act gtg atg gca 691 
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Glu Val Leu Val Val Leu Pro Ser Asn Pro Ala Val Thr Val Met Ala 
180 185 190 195 

ccc cct gtg acg ctt caa cca ttg cca cca tea gaa cac caa ggg aaa 73 9 

Pro Pro Val Thr Leu Gin Pro Leu Pro Pro Ser Glu His Gin Gly Lys ' 
200 205 210 

aat gtt cca gaa aat gta tat aag aac cac tea gaa gaa ata gtc taa 787 
Asn Val Pro Glu Asn Val Tyr Lys Asn His Ser Glu Glu lie Val 
215 220 225 

847 
907 
967 
1027 
1087 
1147 
1207 
1267 
1327 
1387 
1447 
1480 



<210> 24 

<211> 226 

<212> PRT 

<213> Mus musculus 

<400> 24 

Met Gin Gly Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala 
15 10 15 



Pro Pro Ser Glu Asn Ser Val Met Lys Ser Gin Met Trp Asn Glu Asn 
20 25 30 



Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Val Gin 
35 40 45 



Val Met He Ala Leu He Asn Leu Ser Phe Gly He He He Leu Ala 



ttctcttgtg 


tgtatgtgaa 


tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgta 


tgcatttccc 


tgggaaatta 


acacttcatt 


gccctggctt 


ttggggggaa 


tatcagacat 


gecataaett 


tgtaaatcta 


acatctacta 


gtacaacaac 


actttaacat 


cctccattaa 


ttgtatgtac 


aaggggcaat 


gaattagcaa 


agatgttttg 


aaagcaaaat 


gaaacaaaca 


aacaagecag 


cacacaaaaa 


aagacctgtt 


ttgtcttttg 


caaagattaa 


agaaaggegt 


tcacatttct 


ttggggcttt 


tgctaaataa 


gtggatttag 


ctgcttttga 


tcatgtacaa 


acacatgaat 


tatgtatatt 


atattactaa 


tgataggaac 


caatttttat 


ctctgtgcat 


attcatcata 


taataacata 


ctctatactg 


taattgatat 


ggtaaaaatt 


ataataaaga gaagtgactg 


aaatcggaat 


acacaaacac 


acacacacac 


acacatttac 


atacatagtg 


tatatctctg 


tatatacata 


gtacgtgtat 


ggtgtgcatg 


tagagattat 


agaaggtata 


catacacata 


tgtgtacata 


ttacatatga 


cctaaaaaat 


cacttataac 


tgtggtttgg tgctgtaatc 


tcccctaagg 


ctcacatctt 


tgaacagttg 


gtc 
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50 55 60 



Asn Leu Ser Ser Glu Pro Leu lie Ser Val Val Leu Met Ala Pro He 
65 70 75 80 



Trp Gly Pro He Met Phe He Val Ser Gly Ser Leu Ser He Ala Ala 
85 90 95 



Gly Val Arg Pro Thr Lys Lys Leu He He Ser Ser Leu Thr Leu Asn 
100 105 110 



Thr He Thr Ser Val Leu Ala Ala Thr Ala Ser He Met Gly Val Val 
115 120 125 



Ser Val Ala Val Gly Ser Gin Phe Pro Phe Arg Tyr Asn Tyr Thr lie 
130 135 140 



Thr Lys Gly Leu Asp He Leu Met Leu He Leu Asn Met Leu Glu Phe 
145 150 155 160 



Cys He Ala Val Ser He Ser Ala Phe Gly Cys Lys Ala Ser Cys Cys 
165 170 175 



Asn Ser Ser Glu Val Leu Val Val Leu Pro Ser Asn Pro Ala Val Thr 
180 185 190 



Val Met Ala Pro Pro Val Thr Leu Gin Pro Leu Pro Pro Ser Glu His 
195 200 205 



Gin Gly Lys Asn Val Pro Glu Asn Val Tyr Lys Asn His Ser Glu Glu 
210 215 220 



He Val 
225 



<210> 25 

<211> 1187 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 
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<222> (1)..(1187) 
<220> 

<221> CDS 

<222> (36).. (713) 



<400> 25 

ccattgtctg tactgtttct gctgccctgg aaacc atg caa gga ctg gca cag 53 

Met Gin Gly Leu Ala Gin 
1 5 

acc acc atg gca gtg gtt cct gga ggt get cca cct tea gag aat tct 101 

Thr Thr Met Ala Val Val Pro Gly Gly Ala Pro Pro Ser Glu Asn Ser 
10 15 20 

gtt ata aaa tea caa atg tgg aac aag aac aaa gag aaa ttc ttg aag 14 9 

Val He Lys Ser Gin Met Trp Asn Lys Asn Lys Glu Lys Phe Leu Lys 

25 30 35 

ggg gaa ccc aaa gtc etc ggg get ata caa gtt atg att get ttc ata 197 

Gly Glu Pro Lys Val Leu Gly Ala He Gin Val Met He Ala Phe He 
40 45 50 

aac ttc age tta gga ata ata att ata tta aat aga gtt tct gaa cga 245 

Asn Phe Ser Leu Gly He lie lie lie Leu Asn Arg Val Ser Glu Arg 
55 60 65 70 



ttc atg tea gtg etc tta ctg gec cca ttt tgg gga tea ata atg ttc 
Phe Met Ser Val Leu Leu Leu Ala Pro Phe Trp Gly Ser lie Met Phe 
75 80 85 



gec atg ate ate age agt eta agt gtg aac act ate agt tct gtg ttg 
Ala Met lie lie Ser Ser Leu Ser Val Asn Thr lie Ser Ser Val Leu 
105 HO 115 



293 



att ttc tea gga tec ctg tea att gca gca gga gtg aaa cct aca aaa 341 
lie Phe Ser Gly Ser Leu Ser lie Ala Ala Gly Val Lys Pro Thr Lys 
90 95 100 



389 



get gtg gca gca age att att ggc gtc ate agt gtg att tct ggt gtt 437 
Ala Val Ala Ala Ser He lie Gly Val lie Ser Val lie Ser Gly Val 
120 125 130 



ttt cgc caa ttt aga agt caa cca gec ate get agt ttg gat gtt ttg 485 

Phe Arg Gin Phe Arg Ser Gin Pro Ala He Ala Ser Leu Asp Val Leu 

135 140 145 150 

atg aca att ttg aat atg eta gaa ttc tgc att get gtg tec gtc tct 533 

Met Thr lie Leu Asn Met Leu Glu Phe Cys lie Ala Val Ser Val Ser 

155 160 165 

gca ttt ggg tgt aaa get tec tgt tgt aac tec agt gag gtt ctt gta 581 

Ala Phe Gly Cys Lys Ala Ser Cys Cys Asn Ser Ser Glu Val Leu Val 

170 175 180 
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gtg eta cca tea aat tct get gtg aca gtg aca gca ccc ccc atg ata 629 

Val Leu Pro Ser Asn Ser Ala Val Thr Val Thr Ala Pro Pro Met He 
185 190 195 

ctt caa cca ttg cca cca tea gaa tgc caa ggg aaa aat gtt cca gaa 677 

Leu Gin Pro Leu Pro Pro Ser Glu Cys Gin Gly Lys Asn Val Pro Glu 
200 205 210 



aat eta tac agg aac caa cca gga gaa ata gtc taa ttttgatgta 723 
Asn Leu Tyr Arg Asn Gin Pro Gly Glu He Val 



215 


220 


225 








cgtgttttca 


tgtgtgagag 


tgtgtgtgtg 


cacatgtgta 


tgcatttggc 


tttttgtacg 


783 


aatataagac 


atgctctaaa 


taagtaagtc 


aagcatttat 


taagtcaata 


acactttaaa 


843 


atcttctatt 


cattgtatgt 


aeaagaagaa 


tttaattggc 


aaaaaaaatt 


ttaatgtaaa 


903 


etgataaaaa 


ccttggttta 


gtgaggtgaa 


gtctccaaag 


agtgaagaat 


ggagttcatg 


963 


ggtttttttt 


gggggggggt 


tgctaaataa ataaatttgg 


ctgctcacga 


tcatatacag 


1023 


acacataaac 


tatgtaagac 


agcaggtgta 


ttaccttact 


aataatagca 


accaatttta 


1083 


atcctacatg 


aatgttcatc 


ttaatagtat 


atgccttaat 


tgttatggtg 


aaaagttaca 


1143 


ataaacaaaa 


gtgattgaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaag 




1187 



<210> 26 

<211> 225 

<212> PRT 

<213> Mus musculus 

<400> 26 

Met Gin Gly Leu Ala Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala 
15 10 15 

Pro Pro Ser Glu Asn Ser Val lie Lys Ser Gin Met Tip Asn Lys Asn 
20 25 30 

Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Ala He Gin 
35 40 45 

Val Met He Ala Phe He Asn Phe Ser Leu Gly He He He He Leu 
50 55 60 

Asn Arg Val Ser Glu Arg Phe Met Ser Val Leu Leu Leu Ala Pro Phe 
65 70 75 80 
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Trp Gly Ser He Met Phe He Phe Ser Gly Ser Leu Ser He Ala Ala 
85 90 95 



Gly Val Lys Pro Thr Lys Ala Met lie He Ser Ser Leu Ser Val Asn 
100 105 110 



Thr He Ser Ser Val Leu Ala Val Ala Ala Ser He He Gly Val He 
115 120 125 



Ser Val He Ser Gly Val Phe Arg Gin Phe Arg Ser Gin Pro Ala He 
130 135 140 



Ala Ser Leu Asp Val Leu Met Thr He Leu Asn Met Leu Glu Phe Cys 
145 150 155 160 



He Ala Val Ser Val Ser Ala Phe Gly Cys Lys Ala Ser Cys Cys Asn 
165 170 175 



Ser Ser Glu Val Leu Val Val Leu Pro Ser Asn Ser Ala Val Thr Val 
180 185 190 



Thr Ala Pro Pro Met He Leu Gin Pro Leu Pro Pro Ser Glu Cys Gin 
195 200 205 



Gly Lys Asn Val Pro Glu Asn Leu Tyr Arg Asn Gin Pro Gly Glu He 
210 215 220 



Val 
225 



<210> 27 

<211> 1687 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1)..(1687) 
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<220> 

<221> CDS 

<222> (133) . . (867) 



<400> 27 

gttacaagtt cacaagaagg aaccaaggat cagcctgaga gaacccagag ttaaggctct 60 

tcgggttcct gagagctcgg ctggaagtga ctgggtgaca aggcacacag gctcagccgt 120 

gggagctcaa tc atg att cca caa gta gtg acc agt gag act gtg gca atg 171 
Met He Pro Gin Val Val Thr Ser Glu Thr Val Ala Met 
15 10 

att teg cca aat gga atg agt ctt ccc caa aca gac aaa ccc cag cct 219 
He Ser Pro Asn Gly Met Ser Leu Pro Gin Thr Asp Lys Pro Gin Pro 
15 20 25 

ttc cac cag tgg caa gac age ctg aag aaa cat eta aag get gag ate 267 
Phe His Gin Trp Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu He 
30 35 40 45 

aaa gtg atg gcg gca ate cag ate atg tgt get gtg atg gtg ttg agt 315 
Lys Val Met Ala Ala He Gin He Met Cys Ala Val Met Val Leu Ser 
50 55 60 

ctg gga ate att ttg gca tct gtt ccc tec aat eta cac ttt acc tea 363 
Leu Gly He He Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser 
65 70 75 

gtg ttt tea gtc ctg tta aaa tct ggc tac cca ttt ata gga get ttg 411 
Val Phe Ser Val Leu Leu Lys Ser Gly Tyr Pro Phe He Gly Ala Leu 
80 85 90 

ttt ttt ata gtc tct gga att ctg tec ate gtc acg gag aca aag tea 459 
Phe Phe He Val Ser Gly lie Leu Ser He Val Thr Glu Thr Lys Ser 
95 100 105 

aca aaa att ttg gta gac age age ctg act ctg aat ate ctg agt gtt 507 
Thr Lys He Leu Val Asp Ser Ser Leu Thr Leu Asn lie Leu Ser Val 
110 115 120 125 

tea ttt get ttc atg ggc ate att ate ate tct gtc age ctg get ggt 555 
Ser Phe Ala Phe Met Gly He lie lie lie Ser Val Ser Leu Ala Gly 
130 135 140 

ttg cat cct gec tea gag cag tgc ttg cag age aag gag ctt aga cca 603 
Leu His Pro Ala Ser Glu Gin Cys Leu Gin Ser Lys Glu Leu Arg Pro 
145 150 155 

act gaa tat cat tac tac caa ttc ttg gac agg aac gag tgc ttt gee 651 
Thr Glu Tyr His Tyr Tyr Gin Phe Leu Asp Arg Asn Glu Cys Phe Ala 
160 165 170 

gec aag tct gtt ctg get gga gtc ttt tea ctg atg ctg ate agt act 699 
Ala Lys Ser Val Leu Ala Gly Val Phe Ser Leu Met Leu He Ser Thr 
175 180 185 
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atg ttg gaa ctt ggc ctg get gtc etc act gee atg ctg tgg tgg aaa 747 
Met Leu Glu Leu Gly Leu Ala Val Leu Thr Ala Met Leu Trp Trp Lys 
190 195 200 205 

5 cag agt cac tct aac ate cct ggg aat gtt atg ttc ctg cca cat age 795 
Gin Ser His Ser Asn lie Pro Gly Asn Val Met Phe .Leu Pro His Ser 
210 215 220 

tea aat aat gac tec aac atg gaa tea aag gta ctt tgt aac ccc tea 843 
10 Ser Asn Asn Asp Ser Asn Met Glu Ser Lys Val Leu Cys Asn Pro Ser 
225 230 235 

tat gag gaa caa ttg gtt tgt taa gaaaaacaaa acaaaacaaa actaaatacc 897 
Tyr Glu Glu Gin Leu Val Cys 
15 240 

acaaaaacaa atggaactat acegcagaag atatgtcttc atgataatgc agaaattcca 957 
accatcacag ggtagcaatg ettgetaett aaaatgtaga ctgttcatac agtgggtacc 1017 

20 

agtatgagtt gaatgtgtgt attactggca ccctattgat tttcatgacc ttggcttcag 1077 
ccaaagccca gacctacaaa tggtggcctt tcttagaaaa ccaaacagaa tgtttcaggc 113 7 
25 cattgtagtg gggaaaaggg acaacatttt cttgccccct gcaattaaca gcaacagtta 1197 
accattagca gtctttgtat tcagaatctc tgctatgacg tgggtcttct atcatatcag 1257 
ttttattcac tggtgtgaat aaacaaggga ectgeaatga agtctgaaga agtttccatg 1317 

30 

ttgttggttc aaaataaagt ctttttgtga ttgtatattc tttttgtatg gggttttgtg 1377 
tgggttttgt tgttgttgtt gttttgcttt tatgeacaga tcaagtccta agtggtacat 1437 
35 aaacacatgt gtttgetatt ttttttttgt tctctatcaa ccaaaaaaaa aaaaaaaaaa 1497 
ggcatgggaa agagaaaaag gaatatatct gtgtctctgt gtttatgtgg tgtgtgtgtg 1557 
tgtgtgtttt aaactaaaac tgaatgaaca ttcaaagttt agcaatgtat tttggaggta 1617 

40 

ccagataaca tacttatttc tagaatcgee agagcaaatg agagacattt tgtegcttet 1677 
tataagaggc 1687 

45 

<210> 28 
<211> 244 
50 <212> PRT 

<213> Mus musculus 



55 



60 



<400> 28 

Met He Pro Gin Val Val Thr Ser Glu Thr Val Ala Met He Ser Pro 
15 10 15 
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Asn Gly Met Ser Leu Pro Gin Thr Asp Lys Pro Gin Pro Phe His Gin 
20 25 30 



Trp Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu lie Lys Val Met 
35 40 45 



Ala Ala He Gin He Met Cys Ala Val Met Val Leu Ser Leu Gly He 
50 55 60 



He Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser 
65 70 75 80 



Val Leu Leu Lys Ser Gly Tyr Pro Phe He Gly Ala Leu Phe Phe He 
85 90 95 



Val Ser Gly He Leu Ser He Val Thr Glu Thr Lys Ser Thr Lys He 
100 105 110 



Leu Val Asp Ser Ser Leu Thr Leu Asn He Leu Ser Val Ser Phe Ala 
115 120 125 



Phe Met Gly He He He He Ser Val Ser Leu Ala Gly Leu His Pro 
130 135 140 



Ala Ser Glu Gin Cys Leu Gin Ser Lys Glu Leu Arg Pro Thr Glu Tyr 
145 150 155 160 



His Tyr Tyr Gin Phe Leu Asp Arg Asn Glu Cys Phe Ala Ala Lys Ser 
165 ~ 170 175 



Val Leu Ala Gly Val Phe Ser Leu Met Leu lie Ser Thr Met Leu Glu 
180 185 190 



Leu Gly Leu Ala Val Leu Thr Ala Met Leu Trp Trp Lys Gin Ser His 
195 200 205 



Ser Asn He Pro Gly Asn Val Met Phe Leu Pro His Ser Ser Asn Asn 
210 215 220 



Asp Ser Asn Met Glu Ser Lys Val Leu Cys Asn Pro Ser Tyr Glu Glu 
225 230 235 240 



Gin Leu Val Cys 
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<210> 29 

<211> 899 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 
<222> (1)..(899) 

<220> 

<221> CDS 

<222> (164) . . (817) 

<400> 29 

aattcggcac taggaggaag ggctggagac agttacaagt tcacaagaag gaaccaagga 60 

tcagcctgag agaacccaga gttaaggctc ttagggtttc tgagagctcg gctggaagtg 120 

actgggtgac aaggcacaca ggctcagctg tggcagctcc ate atg att cca cag 175 



Met He Pro Gin 
1 



gta gtg acc aat gag acc ate aca acg att tea cca aat gga ate aac 
Val Val Thr Asn Glu Thr He Thr Thr He Ser Pro Asn Gly He Asn 
5 10 15 20 



223 



ttt ccc caa aaa gac gag tec cag cct acc caa cag agg caa gac age 
Phe Pro Gin Lys Asp Glu Ser Gin Pro Thr Gin Gin Arg Gin Asp Ser 
25 30 35 



271 



ctg aag aaa cat eta aag get gag ate aaa gtg ata gtg gca ate cag 
Leu Lys Lys His Leu Lys Ala Glu lie Lys Val He Val Ala He Gin 
40 45 50 



319 



ate atg tgt get gtg aca gtg ttg get ctg gga ate att ttg gca tet 
He Met Cys Ala Val Thr Val Leu Ala Leu Gly He He Leu Ala Ser 
55 60 65 



367 



gtt cct cct gtc cca tat ttt aac tea gtg ttt tet gtc ctg tta aaa 
Val Pro Pro Val Pro Tyr Phe Asn Ser Val Phe Ser Val Leu Leu Lys 
70 75 80 



415 



tet gge tac cca ttt ata gga get ttg ttt ttt att gcc tet gga att 
Ser Gly Tyr Pro Phe He Gly Ala Leu Phe Phe He Ala Ser Gly He 
85 90 95 100 



463 



ttg tec ate att acg gag aga aag tea aca aaa cct ttg gta gat gcc 
Leu Ser lie lie Thr Glu Arg Lys Ser Thr Lys Pro Leu Val Asp Ala 



511 
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105 110 115 

age eta act ctg aat ate ctg agt gtt tea ttt get ttc gtg ggc ate 559 

Ser Leu Thr Leu Asn lie Leu Ser Val Ser Phe Ala Phe Val Gly lie 
120 125 130 

att ate ate tct gtc age ctg get ggt ttg cat cct gee tea gag caa 607 

lie lie lie Ser Val Ser Leu Ala Gly Leu His Pro Ala Ser Glu Gin 

135 140 145 

tgc aag cag age aag gag ctt agt eta att gaa cat gac tac tac caa 655 

Cys Lys Gin Ser Lys Glu Leu Ser Leu He Glu His Asp Tyr Tyr Gin 

150 155 160 

cct ttc tac aac tea gac agg agt gaa tgt gee gtc ace aag tct att 703 

Pro Phe Tyr Asn Ser Asp Arg Ser Glu Cys Ala Val Thr Lys Ser He 
165 170 175 180 

ctg act gga gec ctt tea gtg atg ctg ate ate agt gtg ttg gag ctt 751 

Leu Thr Gly Ala Leu Ser Val Met Leu He He Ser Val Leu, Glu Leu 
185 190 195 

ggt ctg get ttg etc tct gee atg ctg tgg ttg aga gag ggt gtt ctg 799 

Gly Leu Ala Leu Leu Ser Ala Met Leu Trp Leu Arg Glu Gly Val Leu 
200 205 210 



act tec ctt aga atg tga ttttttctgt cttataactc caataatgaa 847 
Thr Ser Leu Arg Met 
215 

gccaatgtgg aatcaaaggc actttgtaac tgtgcatatg aggaacgatt gt 899 



<210> 30 

<211> 217 

<212> PRT 

<213> Mus musculus 

<400> 30 

Met He Pro Gin Val Val Thr Asn Glu Thr He Thr Thr He Ser Pro 
15 10 15 



Asn Gly He Asn Phe Pro Gin Lys Asp Glu Ser Gin Pro Thr Gin Gin 
20 25 30 



Arg Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu He Lys Val He 
35 40 45 



Val Ala He Gin He Met Cys Ala Val Thr Val Leu Ala Leu Gly He 
50 55 60 
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He Leu Ala Ser Val Pro Pro Val Pro Tyr Phe Asn Ser Val Phe Ser 
65 70 75 80 



Val Leu Leu Lys Ser Gly Tyr Pro Phe He Gly Ala Leu Phe Phe He 
85 90 95 



Ala Ser Gly He Leu Ser He He Thr Glu Arg Lys Ser Thr Lys Pro 
100 105 HO 



Leu Val Asp Ala Ser Leu Thr Leu Asn He Leu Ser Val Ser Phe Ala 
115 120 125 



Phe Val Gly He He He He Ser Val Ser Leu Ala Gly Leu His Pro 
130 135 140 



Ala Ser Glu Gin Cys Lys Gin Ser Lys Glu Leu Ser Leu He Glu His 
145 150 155 160 



Asp Tyr Tyr Gin Pro Phe Tyr Asn Ser Asp Arg Ser Glu Cys Ala Val 
165 170 175 



Thr Lys Ser He Leu Thr Gly Ala Leu Ser Val Met Leu He He Ser 
180 185 190 



Val Leu Glu Leu Gly Leu Ala Leu Leu Ser Ala Met Leu Trp Leu Arg 
195 200 205 



Glu Gly Val Leu Thr Ser Leu Arg Met 
210 215 



<210> 31 

<211> 1370 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1)..(1370) 

<220> 
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<221> CDS 

<222> (110) . . (853) 



<400> 31 

caaggatcag actgagaaaa cccatagtta aagctcttgg ggtttctgag agctcagctg 60 

gaagtgactg ggtgacaagg cgcacaggct cagccgtgga agctccatc atg att cca 118 

Met He Pro 
1 

caa gta gtg acc agt gag act gtc aca gtg att tea cca aat gga ate 166 
Gin Val Val Thr Ser Glu Thr Val Thr Val He Ser Pro Asn Gly He 
5 10 15 

age ttt ccc caa aca gac aaa ccc cag cct tec cac cag age caa gac 214 
Ser Phe Pro Gin Thr Asp Lys Pro Gin Pro Ser His Gin Ser Gin Asp 
20 25 30 35 

age ctg aag aaa cat eta aag get gag ate aaa gtg atg gcg gca ate 262 
Ser Leu Lys Lys His Leu Lys Ala Glu He Lys Val Met Ala Ala He 
40 45 50 

cag ate atg tgt get gtg atg gtg ttg agt ctg gga ate att ttg gca 310 
Gin He Met Cys Ala Val Met Val Leu Ser Leu Gly He He Leu Ala 
55 60 65 

tct gtt ccc tec aat eta cac ttt acc tea gtg ttt tec ate ctg tta 358 
Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser He Leu Leu 
70 75 80 

gaa tct ggc tac cca ttt gta gga get ttg ttt ttt gee ate tct gga 406 
Glu Ser Gly Tyr Pro Phe Val Gly Ala Leu Phe Phe Ala lie Ser Gly 
85 90 95 

att ctg tct att gtc aca gag aaa aag atg act aag cct ttg gtt cac 454 
He Leu Ser He Val Thr Glu Lys Lys Met Thr Lys Pro Leu Val His 
100 105 110 115 

age age eta gee ctg age ate ctg agt gtc etc tct get ctt aca ggc 502 
Ser Ser Leu Ala Leu Ser He Leu Ser Val Leu Ser Ala Leu Thr Gly 
120 125 130 

ate get att etc tct gtc agt ttg get get tta gag cct gee ttg cag 550 
He Ala He Leu Ser Val Ser Leu Ala Ala Leu Glu Pro Ala Leu Gin 
135 140 145 

caa tgt aag ctg get ttc aca caa eta gac aca acc caa gac get tat 598 
Gin Cys Lys Leu Ala Phe Thr Gin Leu Asp Thr Thr Gin Asp Ala Tyr 
150 155 160 

cat ttc ttt age cct gag cca tta aac age tgc ttc gtg gee aaa get 646 
His Phe Phe Ser Pro Glu Pro Leu Asn Ser Cys Phe Val Ala Lys Ala 
165 170 175 

get ctg act gga gtc ttt tea ctg atg eta ate age agt gtg ttg gag '694 
Ala Leu Thr Gly Val Phe Ser Leu Met Leu lie Ser Ser Val Leu Glu 
180 185 190 195 
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ctt ggc ctg get gtc etc act gee aca ctg tgg tgg aaa cag age tec 742 
Leu Gly Leu Ala Val Leu Thr Ala Thr Leu Trp Trp Lys Gin Ser Ser 
200 205 210 

tct get ttc tct ggg aat gtg att ttc ctg tct cag aac tea aag aat 790 
Ser Ala Phe Ser Gly Asn Val lie Phe Leu Ser Gin Asn Ser Lys Asn 
215 220 225 

aaa tec agt gta tct tea gag tea ctt tgt aac cct aca tat gaa aac 838 
Lys Ser Ser Val Ser Ser Glu Ser Leu Cys Asn Pro Thr Tyr Glu Asn 
230 235 240 

ata ttg act tea taa gaattaagta gaggttatat agcagaaaaa tctgtcttta 893 
lie Leu Thr Ser 
245 

953 
1013 
1073 
1133 
1193 
1253 
1313 
1370 



<210> 32 

<211> 247 

<212> PRT 

<213> Mus musculus 

<400> 32 

Met He Pro Gin Val Val Thr Ser Glu Thr Val Thr Val He Ser Pro 
15 10 15 



Asn Gly He Ser Phe Pro Gin Thr Asp Lys Pro Gin Pro Ser His Gin 
20 25 30 



Ser Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu He Lys Val Met 
35 40 45 



Ala Ala He Gin He Met Cys Ala Val Met Val Leu Ser Leu Gly He 
50 55 60 



acatgattta 


gaaaagecat 


ttactgtgtg 


acaacaatgc 


ttaatatctt 


gatatcttaa 


tgtgtgtatt 


ggttaatcag 


caaccatgaa 


aaacatacta 


actggctggg 


ttcagtagca 


cgctcttgat 


ttggtgtcag 


tcaaaacaca 


gacctgtaaa 


ttccaattta 


tgtagtggtc 


aaagagcccc 


aattattttc 


tcaaaaaact 


ggaagaatgt 


ttcataggat 


catggtggag 


ccaatgggca 


acagttcttc 


ttatccttgt 


cacttggctg 


caggaggtac 


tgactagggc 


ctgagatcat 


attctgtgtg cgtggcatgg 


acttcatggc 


atctatttta 


ttcataagca 


catgaaaaca 


agtcatctct 


tatgaagtct 


caaagagcat 


aaaaaagtta 


gcctccaaat 


aaagtcttta 


tataatcccc 


ccaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaa 
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lie Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser 
65 70 75 80 



lie Leu Leu Glu Ser Gly Tyr Pro Phe Val Gly Ala Leu Phe Phe Ala 
85 90 95 



lie Ser Gly lie Leu Ser He Val Thr Glu Lys Lys Met Thr Lys Pro 
100 105 110 



Leu Val His Ser Ser Leu Ala Leu Ser He Leu Ser Val Leu Ser Ala 
115 120 125 



Leu Thr Gly He Ala He Leu Ser Val Ser Leu Ala Ala Leu Glu Pro 
130 135 140 



Ala Leu Gin Gin Cys Lys Leu Ala Phe Thr Gin Leu Asp Thr Thr Gin 
145 150 155 160 



Asp Ala Tyr His Phe Phe Ser Pro Glu Pro Leu Asn Ser Cys Phe Val 
165 170 175 



Ala Lys Ala Ala Leu Thr Gly Val Phe Ser Leu Met Leu He Ser Ser 
180 185 190 



Val Leu Glu Leu Gly Leu Ala Val Leu Thr Ala Thr Leu Trp Trp Lys 
195 200 205 



Gin Ser Ser Ser Ala Phe Ser Gly Asn Val He Phe Leu Ser Gin Asn 
210 215 220 



Ser Lys Asn Lys Ser Ser Val Ser Ser Glu Ser Leu Cys Asn Pro Thr 
225 230 235 240 



Tyr Glu Asn He Leu Thr Ser 
245 



<210> 33 

<211> 938 

<212> DNA 

<213> Mus musculus 
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<220> 

<221> CDNA 
<222> (1)..(938) 



<220> 

<221> CDS 

<222> (105) . . (809) 



<400> 33 

aacagctggt aagaggttga gtctgagttc cccaggagca gagtgagcag gcccaggagc 60 

tgcagcacac aggacagggc tgtgaggaca gcatccctgg cacc atg cgt eta cag 116 

Met Arg Leu Gin 
1 

ctt ggc acc aag aac att ggg tgg gac tgc ttt cca aag gac ate att 164 
Leu Gly Thr Lys Asn lie Gly Trp Asp Cys Phe Pro Lys Asp lie lie 
5 10 15 20 

ate cac aaa aga gag aaa act gga cat aca tat gaa aaa gaa gat gac 212 
lie His Lys Arg Glu Lys Thr Gly His Thr Tyr Glu Lys Glu Asp Asp 
25 30 35 

ctg ctg att gga gtg cct agt gaa gec aca ctt ctt gga acc ate cag 260 
Leu Leu lie Gly Val Pro Ser Glu Ala Thr Leu Leu Gly Thr lie Gin 
40 45 50 

eta eta tgt gee ctg ata ctt gca age ttt ggg ggc att ttg gtg tea 3 08 

Leu Leu Cys Ala Leu lie Leu Ala Ser Phe Gly Gly lie Leu Val Ser 
55 60 65 

get tec tac ttc aac cca gaa gtt tec acc act ttg ata tct gga tac 356 
Ala Ser Tyr Phe Asn Pro Glu Val Ser Thr Thr Leu lie Ser Gly Tyr 
70 75 80 

ctg ttt att gga tec etc tgt ttt gee att get gga att etc tec att 404 
Leu Phe lie Gly Ser Leu Cys Phe Ala lie Ala Gly lie Leu Ser lie 
85 90 95 100 

ate tct gaa aaa ate tea acc aaa ccc ttt gca ctg age age ctg gee 452 
lie Ser Glu Lys lie Ser Thr Lys Pro Phe Ala Leu Ser Ser Leu Ala 
105 110 115 

tec aat gta gca age tct gtt gtt gee gtc att ggc etc ttc etc ttc 500 
Ser Asn Val Ala Ser Ser Val Val Ala Val He Gly Leu Phe Leu Phe 
120 125 130 

acc tat tgt ctg ata gee ctg ggg agt get ttc cca cac tgt aac tea 548 
Thr Tyr Cys Leu lie Ala Leu Gly Ser Ala Phe Pro His Cys Asn Ser 
135 140 145 

gaa aag aaa ttc ctg tec tta ttg tct tac ttg aaa tec cac cat tgg 596 
Glu Lys Lys Phe Leu Ser Leu Leu Ser Tyr Leu Lys Ser His His Trp 
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150 155 160 



aag aat gaa gac aag aac tgt tac ctg get tat gtc ggt get atg agt 644 
Lys Asn Glu Asp Lys Asn Cys Tyr Leu Ala Tyr Val Gly Ala Met Ser 
165 170 175 180 

get ctg ggg atg atg ctt etc ttc act gtg ctg gag gtt ttc eta get 692 
Ala Leu Gly Met Met Leu Leu Phe Thr Val Leu Glu Val Phe Leu Ala 
185 190 195 

gga tac agt tct ate ttt tgg tgg aag caa gtc tac tec aac aaa cct 740 
Gly Tyr Ser Ser lie Phe Trp Trp Lys Gin Val Tyr Ser Asn Lys Pro 
200 205 210 

999 99t aca tfct ttc ttg cct caa tea caa gat cat ace caa ctt gtc 788 
Gly Gly Thr Phe Phe Leu Pro Gin Ser Gin Asp His Thr Gin Leu Val 
215 220 225 

aaa age aat ttg ttg caa taa tggatatgaa ataaaaaaga agaaagaaga 83 9 

Lys Ser Asn Leu Leu Gin 
230 

aatctgttgc tcacagcaga gtacagctag acttgectga aatctcagct ttggtagatc 899 
ctaataaaca tctgaagaaa aaaaaaaaaa aaaaaaaaa 938 



<210> 34 

<211> 234 

<212> PRT 

<213> Mus musculus 

<400> 34 

Met Arg Leu Gin Leu Gly Thr Lys Asn lie Gly Trp Asp Cys Phe Pro 
1 5 10 15 



Lys Asp lie He He His Lys Arg Glu Lys Thr Gly His Thr Tyr Glu 
20 25 30 



Lys Glu Asp Asp Leu Leu He Gly Val Pro Ser Glu Ala Thr Leu Leu 
35 40 45 



Gly Thr He Gin Leu Leu Cys Ala Leu lie Leu Ala Ser Phe Gly Gly 
50 55 60 



He Leu Val Ser Ala Ser Tyr Phe Asn Pro Glu Val Ser Thr Thr Leu 
65 70 75 80 



He Ser Gly Tyr Leu Phe He Gly Ser Leu Cys Phe Ala He Ala Gly 
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85 90 95 



He Leu Ser He He Ser Glu Lys He Ser Thr Lys Pro Phe Ala Leu 
100 105 110 



Ser Ser Leu Ala Ser Asn Val Ala Ser Ser Val Val Ala Val He Gly 
115 120 125 



Leu Phe Leu Phe Thr Tyr Cys Leu lie Ala Leu Gly Ser Ala Phe Pro 
130 135 140 



His Cys Asn Ser Glu Lys Lys Phe Leu Ser Leu Leu Ser Tyr Leu Lys 
145 150 155 160 



Ser His His Trp Lys Asn Glu Asp Lys Asn Cys Tyr Leu Ala Tyr Val 
165 170 175 



Gly Ala Met Ser Ala Leu Gly Met Met Leu Leu Phe Thr Val Leu Glu 
180 185 190 



Val Phe Leu Ala Gly Tyr Ser Ser He Phe Trp Trp Lys Gin Val Tyr 
195 200 205 



Ser Asn Lys Pro Gly Gly Thr Phe Phe Leu Pro Gin Ser Gin Asp His 
210 215 220 



Thr Gin Leu Val Lys Ser Asn Leu Leu Gin 
225 230 



<210> 35 

<211> 1025 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDNA 

<222> (1)..(1025) 

<220> 
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<221> CDS 

<222> (12).. (818) 



<400> 35 

aattcgtcga c atg gaa ccg gag caa gag aga ctt act tgg caa ccg ggc 50 
Met Glu Pro Glu Gin Glu Arg Leu Thr Trp Gin Pro Gly 
15 10 

aca gta age atg aat aca gta act tea ccg ggc ccc atg gec aat tct 98 

Thr Val Ser Met Asn Thr Val Thr Ser Pro Gly Pro Met Ala Asn Ser 
15 20 25 

gta tat gtg gtg gca ccg ccc aac agt tat cct gtg gtc cca gga acc 146 

Val Tyr Val Val Ala Pro Pro Asn Ser Tyr Pro Val Val Pro Gly Thr 

30 35 40 45 

gtg cct cag atg cct att tat ccc age aat cag cct caa gtc cat gtg 194 

Val Pro Gin Met Pro He Tyr Pro Ser Asn Gin Pro Gin Val His Val 

50 55 60 

att tct ggt cat ctg cct ggt ttg gtg cca get atg acc gaa cca cct 242 

He Ser Gly His Leu Pro Gly Leu Val Pro Ala Met Thr Glu Pro Pro 

65 70 75 

gee cag aga gtc ttg aaa aag gga caa gtc eta ggg gec ate cag ate 2 90 

Ala Gin Arg Val Leu Lys Lys Gly Gin Val Leu Gly Ala He Gin He 
80 85 90 

ctg ate ggc ctg gtg cac ate ggc ctg ggc tec ate atg ate acc aac 338 

Leu He Gly Leu Val His He Gly Leu Gly Ser He Met He Thr Asn 
95 100 105 

etc ttc age cac tat aca cct gtc tec etc tat gga ggt ttc ccc ttc 386 

Leu Phe Ser His Tyr Thr Pro Val Ser Leu Tyr Gly Gly Phe Pro Phe 

110 115 120 125 

tgg gga ggc ate tgg ttt ate att tea gga tct etc tea gtg gcg gca 434 

Trp Gly Gly He Trp Phe He He Ser Gly Ser Leu Ser Val Ala Ala 

130 135 140 

gaa act cag ccc aat tea cct tgc ctg ctg aat ggc agt gtg ggc ttg 482 

Glu Thr Gin Pro Asn Ser Pro Cys Leu Leu Asn Gly Ser Val Gly Leu 

145 150 155 

aac ate ttc age gec ate tgt tct gcg gtg ggc ate atg etc ttc ate 530 

Asn 'lie Phe Ser Ala He Cys Ser Ala Val Gly He Met Leu Phe He 
160 165 170 

aca gac ate agt ate tea tea gga tac ate tac cct age tac tac cct 578 

Thr Asp He .Ser He Ser Ser Gly Tyr He Tyr Pro Ser Tyr Tyr Pro 
175 180 185 

tac cag gag aac ttg ggt gtg aga aca ggc gtg get att tct agt gtg 626 

Tyr Gin Glu Asn Leu Gly Val Arg Thr Gly Val Ala He Ser Ser Val 

190 195 200 205 

eta etc ate ttc tgc etc ttg gag etc age att gca agt gtg tec tec 674 

Leu Leu He Phe Cys Leu Leu Glu Leu Ser He Ala Ser Val Ser Ser 
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210 215 220 

cac ttt ggc tgc caa gtg gcc tgt tgt cac tac aac aat cca ggt gtg 722 
His Phe Gly Cys Gin Val Ala Cys Cys His Tyr Asn Asn Pro Gly Val 
225 230 235 

gtc att cca aat gtc tat gca gca aac cca gtg gtc ate cca gaa cca 770 
Val lie Pro Asn Val Tyr Ala Ala Asn Pro Val Val He Pro Glu Pro 
240 245 250 

cca aac cca ata cca agt tat tec gaa gta gtt caa gac tec aga taa 818 
Pro Asn Pro He Pro Ser Tyr Ser Glu Val Val Gin Asp Ser Arg 
255 260 265 

gtgaacctga agattctaga gaccaagtga catcctctcc tacctagact cctataaacc 878 

aagttcttcc tttcctgacg aagggtaaat atctttcttg tggcctaaat tatagactct 938 

tgcttcaact caccctggaa aaatctctat taaaacgaga tgggagattg aaatggattc 998 

aaataaagat gctctagccg gaaaaaa 1025 

<210> 36 

<211> 268 

<212> PRT 

<213> Mus musculus 



<400> 36 

Met Glu Pro Glu Gin Glu Arg Leu Thr Trp Gin Pro Gly Thr Val Ser 
1 5 10 15 



Met Asn Thr Val Thr Ser Pro Gly Pro Met Ala Asn Ser Val Tyr Val 
20 25 30 



Val Ala Pro Pro Asn Ser Tyr Pro Val Val Pro Gly Thr Val Pro Gin 
35 40 45 



Met Pro He Tyr Pro Ser Asn Gin Pro Gin Val His Val He Ser Gly 
50 55 60 



His Leu Pro Gly Leu Val Pro Ala Met Thr Glu Pro Pro Ala Gin Arg 
65 70 75 80 



Val Leu Lys Lys Gly Gin Val Leu Gly Ala He Gin He Leu He Gly 
85 90 95 



Leu Val His He Gly Leu Gly Ser He Met He Thr Asn Leu Phe Ser 
100 105 110 
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His Tyr Thr Pro Val Ser Leu Tyr Gly Gly Phe Pro Phe Trp Gly Gly 
115 120 125 



He Trp Phe He He Ser Gly Ser Leu Ser Val Ala Ala Glu Thr Gin 
130 135 140 



Pro Asn Ser Pro Cys Leu Leu Asn Gly Ser Val Gly Leu Asn He Phe 
145 150 155 160 



Ser Ala He Cys Ser Ala Val Gly He Met Leu Phe He Thr Asp He 
165 170 175 



Ser He Ser Ser Gly Tyr He Tyr Pro Ser Tyr Tyr Pro Tyr Gin Glu 
180 185 190 



Asn Leu Gly Val Arg Thr Gly Val Ala He Ser Ser Val Leu Leu He 
195 200 205 



Phe Cys Leu Leu Glu Leu Ser He Ala Ser Val Ser Ser His Phe Gly 
210 215 220 



Cys Gin Val Ala Cys Cys His Tyr Asn Asn Pro Gly Val Val He Pro 
225 230 235 240 



Asn Val Tyr Ala Ala Asn Pro Val Val He Pro Glu Pro Pro Asn Pro 
245 250 255 



He Pro Ser Tyr Ser Glu Val Val Gin Asp Ser Arg 
260 265 



<210> 37 

<211> 1038 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1)..(1038) 
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<220> 

<221> CDS 

<222> (75) . . (878) 



<400> 37 

gctctggatg gtcaggcaca gctgaagcct acaacagagt tcccggggcc agccagggtt 60 

cccggccagg gtca atg get ggc caa get ccc aca gcg gtt ccc ggc agt 110 
Met Ala Gly Gin Ala Pro Thr Ala Val Pro Gly Ser 
15 10 

gtt act gga gaa gtc tea cga tgg cag aac eta gga cct gee cag cct 158 
Val Thr Gly Glu Val Ser Arg Trp Gin Asn Leu Gly Pro Ala Gin Pro 
15 20 25 

gca cag aaa gta gec cag ccc caa aac ctg gtt cca gat ggg cac ctt 206 
Ala Gin Lys Val Ala Gin Pro Gin Asn Leu Val Pro Asp Gly His Leu 
30 35 40 

gag aaa gec ctg gag gga agt gac ctt eta cag aag ttg ggt ggc ttt 254 
Glu Lys Ala Leu Glu Gly Ser Asp Leu Leu Gin Lys Leu Gly Gly Phe 
45 50 55 60 

cac ate gec att get ttt gca cac ttg gee ttt ggt ggc tac ttg ate 302 
His lie Ala He Ala Phe Ala His Leu Ala Phe Gly Gly Tyr Leu He 
65 70 75 

tec aca gtc aag aat ctg cac ctg gtg gtg ttg aag tgc tgg tat cca 350 
Ser Thr Val Lys Asn Leu His Leu Val Val Leu Lys Cys Trp Tyr Pro 
80 85 90 

etc tgg gga act gtc tct ttt etc gtt gca ggg atg gcg gec atg aca 398 
Leu Trp Gly Thr Val Ser Phe Leu Val Ala Gly Met Ala Ala Met Thr 
95 100 105 



aca gtg aca ttc cct aag ace tct ctg aag gtt etc tgt gtg ata gec 446 
Thr Val Thr Phe Pro Lys Thr Ser Leu Lys Val Leu Cys Val He Ala 
110 115 120 

aac gtt ate age ttg ttc tgc gca ctg gee ggc ttc ttt gtc att gee 494 
Asn Val He Ser Leu Phe Cys Ala Leu Ala Gly Phe Phe Val He Ala 
125 130 135 140 

aag gac etc ttc ctg gag ggt cct ttt cca tgg cca ate tgg aga cca 542 
Lys Asp Leu Phe Leu Glu Gly Pro Phe Pro Trp Pro He Trp Arg Pro 
145 150 155 

tac cct gaa ccc aca ace tac ate caa agg eta gag ctg ace ctg ttc 590 
Tyr Pro Glu Pro Thr Thr Tyr He Gin Arg Leu Glu Leu Thr Leu Phe 
160 165 170 

tgc ttc acc ttc ctg gag ate ttc ctg teg ggg tec aca gee ate aca 638 
Cys Phe Thr Phe Leu Glu He Phe Leu Ser Gly Ser Thr Ala He Thr 
175 180 185 

gec tat agg atg aaa cgc ctg caa gca gag gac aag gat gac acc cct 686 
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Ala Tyr Arg Met Lys Arg Leu Gin Ala Glu Asp Lys Asp Asp Thr Pro 
190 195 200 

ttc gtt cct gac aca cct atg gaa ctt aaa ggt eta teg ctg ggg cca 734 
Phe Val Pro Asp Thr Pro Met Glu Leu Lys Gly Leu Ser Leu Gly Pro 
205 210 215 220 

cca cca tec tat aaa gat gtg gee caa gga cac tec tec tct gat act 782 
Pro Pro Ser Tyr Lys Asp Val Ala Gin Gly His Ser Ser Ser Asp Thr 
225 230 235 

ggc aga gec ttg gec acc age tct gga ctg ctg ctg gee tct gac tec 830 
Gly Arg Ala Leu Ala Thr Ser Ser Gly Leu Leu Leu Ala Ser Asp Ser 
240 245 250 

ttc cat caa get ctg ctt cac acg ggc cca agg acc ttg aga aag tag 878 
Phe His Gin Ala Leu Leu His Thr Gly Pro Arg Thr Leu Arg Lys 
255 260 265 

tctaagacta ctagactttc ccaataaaga ctgagttcct geaatgecaa gtccttccct 938 

gcctcccatt ggctggactc eggtgatgae acaggcgtga ettgeacaaa taaatgeata 998 

accactatga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1038 

<210> 38 

<211> 267 

<212> PRT 

<213> Mus musculus 



<400> 38 

Met Ala Gly Gin Ala Pro Thr Ala Val Pro Gly Ser Val Thr Gly Glu 
1 5 10 15 



Val Ser Arg Trp Gin Asn Leu Gly Pro Ala Gin Pro Ala Gin Lys Val 
20 25 30 



Ala Gin Pro Gin Asn Leu Val Pro Asp Gly His Leu Glu Lys Ala Leu 
35 40 45 



Glu Gly Ser Asp Leu Leu Gin Lys Leu Gly Gly Phe His He Ala He 
50 55 60 



Ala Phe Ala His Leu Ala Phe Gly Gly Tyr Leu He Ser Thr Val Lys 
65 70 75 80 



Asn Leu His Leu Val Val Leu Lys Cys Trp Tyr Pro Leu Trp Gly Thr 
85 90 95 
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Val Ser Phe Leu Val Ala Gly Met Ala Ala Met Thr Thr Val Thr Phe 
100 105 110 



Pro Lys Thr Ser Leu Lys Val Leu Cys Val lie Ala Asn Val lie Ser 
115 120 125 



Leu Phe Cys Ala Leu Ala Gly Phe Phe Val lie Ala Lys Asp Leu Phe 
130 135 140 



Leu Glu Gly Pro Phe Pro Trp Pro lie Trp Arg Pro Tyr Pro Glu Pro 
145 150 155 160 



Thr Thr Tyr lie Gin Arg Leu Glu Leu Thr Leu Phe Cys Phe Thr Phe 
165 170 175 



Leu Glu lie Phe Leu Ser Gly Ser Thr Ala lie Thr Ala Tyr Arg Met 
180 185 190 



Lys Arg Leu Gin Ala Glu Asp Lys Asp Asp Thr Pro Phe Val Pro Asp 
195 200 205 



Thr Pro Met Glu Leu Lys Gly Leu Ser Leu Gly Pro Pro Pro Ser Tyr 
210 215 220 



Lys Asp Val Ala Gin Gly His Ser Ser Ser Asp Thr Gly Arg Ala Leu 
225 230 235 240 



Ala Thr Ser Ser Gly Leu Leu Leu Ala Ser Asp Ser Phe His Gin Ala 
245 250 255 



Leu Leu His Thr Gly Pro Arg Thr Leu Arg Lys 
260 265 



<210> 39 

<211> 1146 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1146) 
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<220> 

<221> CDS 

<222> (134) . . (1027) 
<300> 

<301> Tedder, TF, Klejman, G, Schlossman, SF, Saito, H 

<302> Structure of the gene encoding the human B lymphocyte 
differentiation antigen CD20 (Bl) 

<303> J Immunol 

<304> 142 

<305> 7 

<306> 2560-2568 

<307> 1989- - 

<309> - - 

<300> 

<301> Tedder, TF, Streuli, M, Schlossman, SF, and Saito H 

<302> Isolation and structure of a cDNA encoding the Bl (CD20) cell- 
surface antigen of human B lymphocytes 

<303> Proc Natl Acad Sci USA 

<304> 85 

<305> 1 

<30.6> 208-212 

<307> 1988- - 

<309> - - 



<400> 39 

cctcaatgac actcatggag gaaatgctga gagaagcatt cagatgcatg acacaaggta 60 

agactgccaa aaatcttgtt cttgctctcc tcattttgtt atttgtttta tttttaggag 120 

ttttgagagc aaa atg aca aca ccc aga aat tea gta aat ggg act ttc 169 
Met Thr Thr Pro Arg Asn Ser Val Asn Gly Thr Phe 
15 10 

ccg gca gag cca atg aaa ggc cct att get atg caa tct ggt cca aaa 217 
Pro Ala Glu Pro Met Lys Gly Pro lie Ala Met Gin Ser Gly Pro Lys 
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15 20 25 

cca etc ttc agg agg atg tct tea ctg gtg ggc ccc acg caa age ttc 265 

Pro Leu Phe Arg Arg Met Ser Ser Leu Val Gly Pro Thr Gin Ser Phe 
30 ~ 35 40 

ttc atg agg gaa tct aag act ttg ggg get gtc cag att atg aat ggg 313 

Phe Met Arg Glu Ser Lys Thr Leu Gly Ala Val Gin He Met Asn Gly 

45 50 55 60 

etc ttc cac att gee ctg ggg ggt ctt ctg atg ate cca gca ggg ate 361 

Leu Phe His He Ala Leu Gly Gly Leu Leu Met He Pro Ala Gly lie 
65 70 75 



tat gca ccc ate tgt gtg act gtg tgg tac cct etc tgg gga ggc att 4 09 

Tyr Ala Pro He Cys Val Thr Val Trp Tyr Pro Leu Trp Gly Gly He 
80 85 90 

atg tat att att tec gga tea etc ttg gca gca acg gag aaa aac tct 457 
Met Tyr He He Ser Gly Ser Leu Leu Ala Ala Thr Glu Lys Asn Ser 
95 100 105 

agg aag tgt ttg gtc aaa gga aaa atg ata atg aat tea ttg age etc 505 
Arg Lys Cys Leu Val Lys Gly Lys Met He Met Asn Ser Leu Ser Leu 
110 115 120 

ttt get gee att tct gga atg att ctt tea ate atg gac ata ctt aat 553 
Phe Ala Ala He Ser Gly Met He Leu Ser He Met Asp He Leu Asn 
125 130 135 140 

att aaa att tec cat ttt tta aaa atg gag agt ctg aat ttt att aga 601 
He Lys He Ser His Phe Leu Lys Met Glu Ser Leu Asn Phe He Arg 
145 150 155 

get cac aca cca tat att aac ata tac aac tgt gaa cca get aat ccc 649 
Ala His Thr Pro Tyr He Asn He Tyr Asn Cys Glu Pro Ala Asn Pro 
160 165 170 

tct gag aaa aac tec cca tct ace caa tac tgt tac age ata caa tct 697 
Ser Glu Lys Asn Ser Pro Ser Thr Gin Tyr Cys Tyr Ser He Gin Ser 
175 180 185 

ctg ttc ttg ggc att ttg tea gtg atg ctg ate ttt gee ttc ttc cag 745 
Leu Phe Leu Gly He Leu Ser Val Met Leu He Phe Ala Phe Phe Gin 
190 195 200 

gaa ctt gta ata get ggc ate gtt gag aat gaa tgg aaa aga acg tgc 793 
Glu Leu Val He Ala Gly He Val Glu Asn Glu Trp Lys Arg Thr Cys 
205 210 215 220 

tec aga ccc aaa tct aac ata gtt etc ctg tea gca gaa gaa aaa aaa 841 
Ser Arg Pro Lys Ser Asn He Val Leu Leu Ser Ala Glu Glu Lys Lys 
225 230 235 

gaa cag act att gaa ata aaa gaa gaa gtg gtt ggg eta act gaa aca 889 
Glu Gin Thr He Glu He Lys Glu Glu Val Val Gly Leu Thr Glu Thr 
240 245 250 

tct tec caa cca aag aat gaa gaa gac att gaa att att cca ate caa 937 
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Ser Ser Gin Pro Lys Asn Glu Glu Asp lie Glu lie lie Pro lie Gin 
255 260 265 

gaa gag gaa gaa gaa gaa aca gag acg aac ttt cca gaa cct ccc caa 985 
Glu Glu Glu Glu Glu Glu Thr Glu Thr Asn Phe Pro Glu Pro Pro Gin 
270 275 280 

gat cag gaa tec tea cca ata gaa aat gac age tct cct taa 1027 
Asp Gin Glu Ser Ser Pro lie Glu Asn Asp Ser Ser Pro 
285 290 295 

gtgatttctt ctgttttctg tttccttttt taaacattag tgttcatagc ttccaagaga 1087 
catgetgact ttcatttctt gaggtactct gcacatacgc accacatctc tatctggee 1146 

<210> 40 

<211> 297 

<212> PRT 

<213> Homo sapiens 



<400> 40 

Met Thr Thr Pro Arg Asn Ser Val Asn Gly Thr Phe Pro Ala Glu Pro 
15 10 15 



Met Lys Gly Pro lie Ala Met Gin Ser Gly Pro Lys Pro Leu Phe Arg 
20 25 30 



Arg Met Ser Ser Leu Val Gly Pro Thr Gin Ser Phe Phe Met Arg Glu 
35 40 45 



Ser Lys Thr Leu Gly Ala Val Gin He Met Asn Gly Leu Phe His He 
50 55 60 



Ala Leu Gly Gly Leu Leu Met He Pro Ala Gly He Tyr Ala Pro He 
65 70 75 80 



Cys Val Thr Val Trp Tyr Pro Leu Trp Gly Gly He Met Tyr He He 
85 90 95 



Ser Gly Ser Leu Leu Ala Ala Thr Glu Lys Asn Ser Arg Lys Cys Leu 
100 105 110 



Val Lys Gly Lys Met He Met Asn Ser Leu Ser Leu Phe Ala Ala He 
115 120 125 



Ser Gly Met He Leu Ser He Met Asp He Leu Asn He Lys He Ser 
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130 135 140 



His Phe Leu Lys Met Glu Ser Leu Asn Phe lie Arg Ala His Thr Pro 
145 150 155 160 



Tyr lie Asn lie Tyr Asn Cys Glu Pro Ala Asn Pro Ser Glu Lys Asn 
165 170 175 



Ser Pro Ser Thr Gin Tyr Cys Tyr Ser lie Gin Ser Leu Phe Leu Gly 
180 185 190 



lie Leu Ser Val Met Leu He Phe Ala Phe Phe Gin Glu Leu Val He 
195 200 205 



Ala Gly He Val Glu Asn Glu Trp Lys Arg Thr Cys Ser Arg Pro Lys 
210 215 220 



Ser Asn He Val Leu Leu Ser Ala Glu Glu Lys Lys Glu Gin Thr lie 
225 230 235 240 



Glu He Lys Glu Glu Val Val Gly Leu Thr Glu Thr Ser Ser Gin Pro 
245 250 255 



Lys Asn Glu Glu Asp He Glu He He Pro He Gin Glu Glu Glu Glu 
260 265 270 



Glu Glu Thr Glu Thr Asn Phe Pro Glu Pro Pro Gin Asp Gin Glu Ser 
275 280 285 



Ser Pro He Glu Asn Asp Ser Ser Pro 
290 295 



<210> 41 

<211> 894 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1)..(894) 



-58- 



WO 02/062946 



PCTAJS01/48437 



<220> 

<221> CDS 

<222> (51) . . (785) 

<300> 

<301> Kuster, H, Zhang, L, Brini, AT, MacGlashan, DWJ, Kinet, J-P 

<302> The gene and cDNA for the human high affinity immunoglobulin E 
receptor beta chain and expression of the complete human receptor 

<303> J Biol Cham 

<304> 267 

<305> 18 

<306> 12782-12787 

<307> 1992- - 

<309> - - 



<400> 41 

tctcaatata ataatattct ttattcctgg acagctcggt taatgaaaaa atg gac 56 

Met Asp 
1 



aca gaa agt aat agg aga gca aat ctt get etc cca cag gag cct tec 104 

Thr Glu Ser Asn Arg Arg Ala Asn Leu Ala Leu Pro Gin Glu Pro Ser 
5 10 15 

a 9 fc 9^9 cct 9 ca tfct 9 aa gtc ttg gaa ata tct ccc cag gaa gta tct 152 

Ser Val Pro Ala Phe Glu Val Leu Glu lie Ser Pro Gin Glu Val Ser 
20 25 30 



tea ggc aga eta ttg aag teg gee tea tec cca cca ctg cat aca tgg 200 

Ser Gly Arg Leu Leu Lys Ser Ala Ser Ser Pro Pro Leu His Thr Trp 

35 40 45 50 

ctg aca gtt ttg aaa aaa gag cag gag ttc ctg ggg gta aca caa att 24 8 

Leu Thr val Leu Lys Lys Glu Gin Glu Phe Leu Gly Val Thr Gin lie 

55 60 65 

ctg act get atg ata tgc ctt tgt ttt gga aca gtt gtc tgc tct gta 296 

Leu Thr Ala Met lie Cys Leu Cys Phe Gly Thr Val Val Cys Ser Val 

70 75 80 

ctt gat att tea cac att gag gga gac att ttt tea tea ttt aaa gca 344 

Leu Asp lie Ser His He Glu Gly Asp He Phe Ser Ser Phe Lys Ala 

85 90 95 

ggt tat cca ttc tgg gga gee ata ttt ttt tct att tct gga atg ttg 3 92 
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Gly Tyr Pro Phe Trp Gly Ala lie Phe Phe Ser lie Ser Gly Met Leu 

100 105 110 

tea att ata tct gaa agg aga aat gca aca tat ctg gtg aga gga age 440 

5 Ser lie lie Ser Glu Arg Arg Asn Ala Thr Tyr Leu Val Arg Gly Ser 
115 120 125 130 

ctg gga gca aac act gec age age ata get ggg gga acg gga att ace 488 

Leu Gly Ala Asn Thr Ala Ser Ser lie Ala Gly Gly Thr Gly lie Thr 
10 135 140 145 

ate ctg ate ate aac ctg aag aag age ttg gee tat ate cac ate cac 536 

lie Leu lie He Asn Leu Lys Lys Ser Leu Ala Tyr He His He His 
150 155 160 

15 

agt tgc cag aaa ttt ttt gag ace aag tgc ttt atg get tec ttt tec 584 

Ser Cys Gin Lys Phe Phe Glu Thr Lys Cys Phe Met Ala Ser Phe Ser 

165 170 175 

20 act gaa att gta gtg atg atg ctg ttt etc acc att ctg gga ctt ggt 632 

Thr Glu He Val Val Met Met Leu Phe Leu Thr He Leu Gly Leu Gly 

180 185 190 

agt get gtg tea etc aca ate tgt gga get ggg gaa gaa etc aaa gga 680 

25 Ser Ala Val Ser Leu Thr He Cys Gly Ala Gly Glu Glu Leu Lys Gly 
195 200 205 210 

aac aag gtt cca gag gat cgt gtt tat gaa gaa tta aac ata tat tea 728 

Asn Lys Val Pro Glu Asp Arg Val Tyr Glu Glu Leu Asn He Tyr Ser 
30 215 220 225 

get act tac agt gag ttg gaa gac cca ggg gaa atg tct cct ccc att 776 

Ala Thr Tyr Ser Glu Leu Glu Asp Pro Gly Glu Met Ser Pro Pro He 
230 235 240 

35 

gat tta taa gaatcacgtg tccagaacac tctgattcac agecaaggat 825 
Asp Leu 



40 ccagaaggcc aaggtcttgt taaggggcta ctggaaaaat ttctattctc tccacagcct 885 
gctggtttt 894 



45 <210> 42 

<211> 244 

<212> PRT 

50 

<213> Homo sapiens 



55 <400> . 42 

Met Asp Thr Glu Ser Asn Arg Arg Ala Asn Leu Ala Leu Pro Gin Glu 
1 5 10 15 

60 

Pro Ser Ser Val Pro Ala Phe Glu Val Leu Glu He Ser Pro Gin Glu 
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20 



25 



30 



Val Ser Ser Gly Arg Leu Leu Lys Ser Ala Ser Ser Pro Pro Leu His 
35 40 45 



Thr Trp Leu Thr Val Leu Lys Lys Glu Gin Glu Phe Leu Gly Val Thr 
50 55 60 



Gin He Leu Thr Ala Met He Cys Leu Cys Phe Gly Thr Val Val Cys 
65 70 75 80 



Ser Val Leu Asp He Ser His He Glu Gly Asp He Phe Ser Ser Phe 
85 90 95 



Lys Ala Gly Tyr Pro Phe Trp Gly Ala He Phe Phe Ser He Ser Gly 
100 105 110 



Met Leu Ser He He Ser Glu Arg Arg Asn Ala Thr Tyr Leu Val Arg 
115 120 125 



Gly Ser Leu Gly Ala Asn Thr Ala Ser Ser He Ala Gly Gly Thr Gly 
130 135 140 



He Thr He Leu He He Asn Leu Lys Lys Ser Leu Ala Tyr He His 
145 150 155 160 



He His Ser Cys Gin Lys Phe Phe Glu Thr Lys Cys Phe Met Ala Ser 
165 170 175 



Phe Ser Thr Glu He Val Val Met Met Leu Phe Leu Thr He Leu Gly 
180 185 190 



Leu Gly Ser Ala Val Ser Leu Thr He Cys Gly Ala Gly Glu Glu Leu 
195 200 205 



Lys Gly Asn Lys Val Pro Glu Asp Arg Val Tyr Glu Glu Leu Asn He 
210 215 220 



Tyr Ser Ala Thr Tyr Ser Glu Leu Glu Asp Pro Gly Glu Met Ser Pro 
225 230 235 240 



Pro He Asp Leu 



<210> 43 
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<211> 1646 

<212> DNA 

<213> homo sapiens 

<220> 

<221> CDNA 

<222> (1) . . (1646) 

<220> 

<221> CDS 

<222> (99) . . (743) 

<300> 

<301> Adra, CN, Lelias, JM, Kobayashi, H, Kaghad, M, Morrison, P, 
Rowley, JD, Lim, B 

<3 02> Cloning of the cDNA for a hematopoietic cell -specific protein 
related to CD20 and the beta subunit of the high-affinity IgE receptor: 
evidence for a family of proteins with four membrane -spanning regions 

<303> Proc Natl Acad Sci USA 

<304> 91 

<305> 21 

<306> 10178-10182 

<307> 1994- - 

<309> - - 



<400> 43 

tagtgatctt ttctgagtgt ctcctacttg cgacaaggtg gacttgggag gaaagccgtc 60 

tgccaaagcc tgaagcctcc aagccataaa caacccca atg gcc tec cac gaa gtt 116 

Met Ala Ser His Glu Val 
1 5 

gat aat gca gag ctg ggg tea gcc tct gcc cat ggt ace cca ggc agt 164 
Asp Asn Ala Glu Leu Gly Ser Ala Ser Ala His Gly Thr Pro Gly Ser 
10 15 20 

gag acg gga cca gaa gag ctg aat act tct gtc tac cac ccc ata aat 212 
Glu Thr Gly Pro Glu Glu Leu Asn Thr Ser Val Tyr His Pro lie Asn 
25 30 35 
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gga tea cca gat tat cag aaa gca aaa tta caa gtt ctt ggg gec ate 260 
Gly Ser Pro Asp Tyr Gin Lys Ala Lys Leu Gin Val Leu Gly Ala lie 
40 45 50 

cag ate ctg aat gca gca atg att ctg get ttg ggt gtc ttt ctg ggt 308 
Gin lie Leu Asn Ala Ala Met lie Leu Ala Leu Gly Val Phe Leu Gly 
55 60 65 70 

tec ttg caa tac cca tac cac ttc caa aag cac ttc ttt ttc ttc acc 356 
Ser Leu Gin Tyr Pro Tyr His Phe Gin Lys His Phe Phe Phe Phe Thr 
75 80 85 

ttc tac aca ggc tac ccg att tgg ggt get gtg ttt ttc tgt agt tea 404 
Phe Tyr Thr Gly Tyr Pro lie Trp Gly Ala Val Phe Phe Cys Ser Ser 
90 95 100 

gga acc ttg tct gtt gta gca ggg ata aaa ccc aca aga aca tgg ata 452 
Gly Thr Leu Ser Val Val Ala Gly lie Lys Pro Thr Arg Thr Trp lie 
105 110 115 

cag aac agt ttt gga atg aac att gee agt get aca att gca eta gtg 500 
Gin Asn Ser Phe Gly Met Asn lie Ala Ser Ala Thr lie Ala Leu Val 
120 125 130 

ggg act get ttt etc tea eta aat ata gca gtt aat ate cag tea tta 548 
Gly Thr Ala Phe Leu Ser Leu Asn lie Ala Val Asn lie Gin Ser Leu 
135 140 145 150 

agg agt tgt cac tct tea tea gag tea ccg gac eta tgc aat tac atg 596 
Arg Ser Cys His Ser Ser Ser Glu Ser Pro Asp Leu Cys Asn Tyr Met 
155 160 165 

ggc tec ata tea aat ggc atg gtg tct eta ctg ctg att etc acc ttg 644 
Gly Ser lie Ser Asn Gly Met Val Ser Leu Leu Leu lie Leu Thr Leu 
170 175 180 

ctg gaa tta tgc gta act ate tct acc ata gee atg tgg tgc aat gca 692 
Leu Glu Leu Cys Val Thr lie Ser Thr lie Ala Met Trp Cys Asn Ala 
185 190 195 

aac tgc tgt aat tea aga gag gaa att tec tea cct ccc aat tct gtg 740 
Asn Cys Cys Asn Ser Arg Glu Glu lie Ser Ser Pro Pro Asn Ser Val 
200 205 210 

taa tcaagaatac ctccttatga aaataattct gagagcatga atatttgacc 793 

ttaaatctcc agtgactcag agcttcaccc acaaactcag gagaacataa gcctgctcgt 853 

aaagctcaat ccttctatca tggcaccaat cacaagaacc ttggacgttt gactgactct 913 

atcctttctc tcctaactat aaatcctatt tgtgtgtcgt gggtatggaa ggacagatat 973 

atttctttag gcattcttgg atatctgtaa cttctatgat cattactcca aagttgtttc 1033 

cagaaattgg ttctatttct tcttatccac ctactccatt gctttatgag gtttaaggaa 1093 

ggaaggcggt ataatcccta ttcaatatat tttttctaaa atccaacttc tgaccgccca 1153 

gtaggaagaa aaatgagaca ttttttccat tacagagaaa tgcttcttga ctttaacatc 1213 
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agcattataa 


aaagtgtcaa 


ataaaaaatt accatcatta 


tcattaaaat aaattttcac 


1273 


tgtatttgag 


atgggagggt 


taaggctcag ggattttatt 


tcagtgaact gctggaactc 


1333 


acacatgccc 


tgatatgtaa 


atgatgattt atgttggcga 


gtctgagagc aagcccaaat 


1393 


gtgttcttca 


aaggacaatg 


ggaaactgta aagtagagaa 


ctaaagaata aggcctttag 


1453 


aatctgacac 


atctgggctc 


aaattctgaa actgtcactt 


attacctgta tgaacatggg 


1513 


caaattatct 


aatctctctg 


atctattttt cctcatctgt 


aaaataggtg taataataac 


1573 


aactactttg 


tcggttgctc 


tgagggttaa atgaaaataa 


aaagaaaatg tgaaacagca 


1633 


ccacaggtac 


ttg 






1646 



<210> 44 

<211> 214 

<212> PRT 

<213> homo sapiens 

<400> 44 

Met Ala Ser His Glu Val Asp Asn Ala Glu Leu Gly Ser Ala Ser Ala 
15 10 15 



His Gly Thr Pro Gly Ser Glu Thr Gly Pro Glu Glu Leu Asn Thr Ser 
20 25 30 



Val Tyr His Pro He Asn Gly Ser Pro Asp Tyr Gin Lys Ala Lys Leu 
35 40 45 



Gin Val Leu Gly Ala He Gin He Leu Asn Ala Ala Met He Leu Ala 
50 55 60 



Leu Gly Val Phe Leu Gly Ser Leu Gin Tyr Pro Tyr His Phe Gin Lys 
65 70 75 80 



His Phe Phe Phe Phe Thr Phe Tyr Thr Gly Tyr Pro He Trp Gly Ala 
85 90 95 



Val Phe Phe Cys Ser Ser Gly Thr Leu Ser Val Val Ala Gly He Lys 
100 105 110 



Pro Thr Arg Thr Trp He Gin Asn Ser Phe Gly Met Asn He Ala Ser 
115 120 125 
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Ala Thr He Ala Leu Val Gly Thr Ala Phe Leu Ser Leu Asn He Ala 
130 135 140 



Val Asn He Gin Ser Leu Arg Ser Cys His Ser Ser Ser Glu Ser Pro 
145 150 155 160 



Asp Leu Cys Asn Tyr Met Gly Ser He Ser Asn Gly Met Val Ser Leu 
165 170 175 



Leu Leu He Leu Thr Leu Leu Glu Leu Cys Val Thr He Ser Thr He 
180 185 190 



Ala Met Trp Cys Asn Ala Asn Cys Cys Asn Ser Arg Glu Glu He Ser 
195 200 205 



Ser Pro Pro Asn Ser Val 
210 



<210> 45 

<211> 1182 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDNA 

<222> (1)..{1182) 

<220> 

<221> CDS 

<222> (58) . . (861) 



<400> 45 

acacaggttg gagcagagaa agaggaaaca tagaggtgcc aaaggaacaa agacata 57 

atg atg tea tec aag cca aca age cat get gaa gta aat gaa acc ata 105 

Met Met Ser Ser Lys Pro Thr Ser His Ala Glu Val Asn Glu Thr He 

15 10 15 

ccc aac cct tac cca cca ggc age ttt atg get cct gga ttt caa cag 153 

Pro Asn Pro Tyr Pro Pro Gly Ser Phe Met Ala Pro Gly Phe Gin Gin 
20 25 30 
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cct ctg ggt tea ate aac tta gaa aac caa get cag ggt get cag cgt 201 

Pro Leu Gly Ser lie Asn Leu Glu Asn Gin Ala Gin Gly Ala Gin Arg 
35 40 45 

get cag ccc tac ggc ate aca tct ccg gga ate ttt get age agt caa 24 9 

Ala Gin Pro Tyr Gly lie Thr Ser Pro Gly lie Phe Ala Ser Ser Gin 

50 SS 60 

ccg ggt caa gga aat ata caa atg ata aat cca agt gtg gga aca gca 2 97 

Pro Gly Gin Gly Asn He Gin Met He Asn Pro Ser Val Gly Thr Ala 

65 70 75 80 

gta atg aac ttt aaa gaa gaa gca aag gca eta ggg gtg ate cag ate 345 

Val Met Asn Phe Lys Glu Glu Ala Lys Ala Leu Gly Val He Gin He 

85 90 95 

atg gtt gga ttg atg cac att ggt ttt gga att gtt ttg tgt tta ata 393 

Met Val Gly Leu Met His He Gly Phe Gly He Val Leu Cys Leu He 

100 105 110 

tec ttc tct ttt aga gaa gta tta ggt ttt gee tct act get gtt att 441 

Ser Phe Ser Phe Arg Glu Val Leu Gly Phe Ala Ser Thr Ala Val He 
115 120 125 

ggt gga tac cca ttc tgg ggt ggc ctt tct ttt att ate tct ggc tct 489 

Gly Gly Tyr Pro Phe Trp Gly Gly Leu Ser Phe He He Ser Gly Ser 

130 135 140 



etc tct gtg tea gca tec aag gag ctt tec cgt tgt ctg gtg aaa ggc 53 7 

Leu Ser Val Ser Ala Ser Lys Glu Leu Ser Arg Cys Leu Val Lys Gly 

145 150 155 160 

age ctg gga atg aac att gtt agt tct ate ttg gee ttc att gga gtg 585 

Ser Leu Gly Met Asn He Val Ser Ser He Leu Ala Phe He Gly Val 
165 170 175 

att ctg ctg ctg gtg gat atg tgc ate aat ggg gta get ggc caa gac 633 

He Leu Leu Leu Val Asp Met Cys lie Asn Gly Val Ala Gly Gin Asp 

180 185 190 

tac tgg gec gtg ctt tct gga aaa ggc att tea gee acg ctg atg ate 681 

Tyr Trp Ala Val Leu Ser Gly Lys Gly He Ser Ala Thr Leu Met He 
195 200 205 

ttc tec etc ttg gag ttc ttc gta get tgt gee aca gee cat ttt gee 729 

Phe Ser Leu Leu Glu Phe Phe Val Ala Cys Ala Thr Ala His Phe Ala 
210 215 220 

aac caa gca aac ace aca ace aat atg tct gtc ctg gtt att cca aat 777 

Asn Gin Ala Asn Thr Thr Thr Asn Met Ser Val Leu Val He Pro Asn 

225 230 235 240 

atg tat gaa age aac cct gtg aca cca gcg tct tct tea get cct ccc 825 

Met Tyr Glu Ser Asn Pro Val Thr Pro Ala Ser Ser Ser Ala Pro Pro 
245 250 255 

aga tgc aac aac tac tea get aat gee cct aaa tag taaaagaaaa 871 

Arg Cys Asn Asn Tyr Ser Ala Asn Ala Pro Lys 
260 265 
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aggggtatca gtctaatctc 


atggagaaaa 


actacttgca 


aaaacttctt 


aagaagatgt 


931 


cttttattgt ctacaatgat 


ttctagtctt 


taaaaactgt 


gtttgagatt 


tgtttttagg 


991 


ttggtcgcta atgatggctg 


tatctccctt 


cactgtctct 


tcctacatta 


ccactactac 


1051 


atgctggcaa aggtgaagga 


tcagaggact 


gaaaaatgat 


tctgcaactc 


tcttaaagtt 


1111 


agaaatgttt ctgttcatat 


tactttttcc 


ttaataaaat 


gtcattagaa 


acaaaaaaaa 


1171 


aaaaaaaaaa a 










1182 



<210> 46 

<211> 267 

<212> PRT 

<213> Homo sapiens 

<400> 46 

Met Met Ser Ser Lys Pro Thr Ser His Ala Glu Val Asn Glu Thr lie 
15 10 15 



Pro Asn Pro Tyr Pro Pro Gly Ser Phe Met Ala Pro Gly Phe Gin Gin 
20 25 30 



Pro Leu Gly Ser lie Asn Leu Glu Asn Gin Ala Gin Gly Ala Gin Arg 
35 40 45 



Ala Gin Pro Tyr Gly lie Thr Ser Pro Gly lie Phe Ala Ser Ser Gin 
50 55 60 



Pro Gly Gin Gly Asn lie Gin Met lie Asn Pro Ser Val Gly Thr Ala 
65 70 75 80 



Val Met Asn Phe Lys Glu Glu Ala Lys Ala Leu Gly Val He Gin He 
85 90 95 



Met Val Gly Leu Met His He Gly Phe Gly He Val Leu Cys Leu He 
100 105 110 



Ser Phe Ser Phe Arg Glu Val Leu Gly Phe Ala Ser Thr Ala Val He 
115 120 125 



Gly Gly Tyr Pro Phe Trp Gly Gly Leu Ser Phe He He Ser Gly Ser 
130 135 140 
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Leu Ser Val Ser Ala Ser Lys Glu Leu Ser Arg Cys Leu Val Lys Gly 
145 150 155 160 



Ser Leu Gly Met Asn lie Val Ser Ser lie Leu Ala Phe lie Gly Val 
165 170 175 



lie Leu Leu Leu Val Asp Met Cys lie Asn Gly Val Ala Gly Gin Asp 
180 . 185 190 



Tyr Trp Ala Val Leu Ser Gly Lys Gly He Ser Ala Thr Leu Met He 
195 200 205 



Phe Ser Leu Leu Glu Phe Phe Val Ala Cys Ala Thr Ala His Phe Ala 
210 215 220 



Asn Gin Ala Asn Thr Thr Thr Asn Met Ser Val Leu Val He Pro Asn 
225 230 235 240 



Met Tyr Glu Ser Asn Pro Val Thr Pro Ala Ser Ser Ser Ala Pro Pro 
245 250 255 



Arg Cys Asn Asn Tyr Ser Ala Asn Ala Pro Lys 
260 265 



<210> 47 

<211> 1388 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDNA 

<222> (1) . . (1388) 

<220> 

<221> CDS 

<222> (178) . . (1053) 
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<300> 



<301> Tedder, TF, Klejman, G, Disteche, CM, Adler, DA, Schlossman, SF, 
.Saito, H 

<302> Cloning of complementary DNA encoding a new mouse B lymphocyte 
differentiation antigen, homologous to the human Bl (CD20) antigen and 
localization of the gen to chromosome 19 

<303> J Immunol 

<304> 141 

<305> 12 



<306> 4388-4394 



<307> 1988- - 

<309> - - 



<400> 47 

gaattccttt tttttttttt tttttaaaga tttatttatt attatatgta agtacactgt ' 60 

agctatcttc aagtacttga gatagaagag gccaactgat ctcagctgtg agtggctaat 120 

ttggccctta agccttggag ccttggagcc ttggagaccc aggcgtttga aaactca 177 

atg agt gga cct ttc cca gca gag cct aca aaa ggt ccc etc gec atg 225 

Met Ser Gly Pro Phe Pro Ala Glu Pro Thr Lys Gly Pro Leu Ala Met 
15 10 15 



caa cct get cca aaa gtg aac etc aaa agg aca tct tea ctg gtg ggc 
Gin Pro Ala Pro Lys Val Asn Leu Lys Arg Thr Ser Ser Leu Val Gly 
20 25 30 



273 



ccc aca caa age ttc ttc atg agg gaa tea aag get ttg ggg get gtc 
Pro Thr Gin Ser Phe Phe Met Arg Glu Ser Lys Ala Leu Gly Ala Val 
35 40 45 



321 



caa ate atg aat ggc etc ttc cat att acc ctg ggg gga ctg ctg atg 
Gin lie Met Asn Gly Leu Phe His lie Thr Leu Gly Gly Leu Leu Met 
50 55 60 



369 



ate ccc aca ggg gtc ttc gca ccc ate tgt ttg agt gta tgg tac cct 
lie Pro Thr Gly Val Phe Ala Pro lie Cys Leu Ser Val Trp Tyr Pro 
65 70 75 80 



417 



etc tgg gga ggc att atg tac att att tea gga tea etc ctg gca get 
Leu Trp Gly Gly lie Met Tyr He He Ser Gly Ser Leu Leu Ala Ala 
85 90 95 



465 



gca gca gaa aaa acc tec agg aag agt ttg gtc aaa gca aaa gtg ata 
Ala Ala Glu Lys Thr Ser Arg Lys Ser Leu Val Lys Ala Lys Val He 
100 105 110 



513 



atg age tct eta age etc ttt get gee att tct gga ata att ctt tea 
Met Ser Ser Leu Ser Leu Phe Ala Ala He Ser Gly He He Leu Ser 
115 120 125 



561 
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ate atg gac ata ctt aac atg aea ctt tct cat ttt tta aaa atg aga 609 
lie Met Asp lie Leu Asn Met Thr Leu Ser His Phe Leu Lys Met Arg 
130 135 140 

aga ctg gag ctt att caa act tec aag ccg tat gtt gat ate tac gac 657 
Arg Leu Glu Leu lie Gin Thr Ser Lys Pro Tyr Val Asp lie Tyr Asp 
145 150 155 160 

tgt gaa cca tct aat tec tea gag aaa aac tec cca tct aca cag tac 705 
Cys Glu Pro Ser Asn Ser Ser Glu Lys Asn Ser Pro Ser Thr Gin Tyr 
165 170 175 

tgt aac age att cag tct gtg ttc ttg ggc att ctg teg gcg atg ctg 753 
Cys Asn Ser lie Gin Ser Val Phe Leu Gly lie Leu Ser Ala Met Leu 
180 185 190 

ate tct gee ttc ttc cag aaa ctt gtg aca get ggt att gtg gag aat 801 
He Ser Ala Phe Phe Gin Lys Leu Val Thr Ala Gly He Val Glu Asn 
195 200 205 

gag tgg aaa aga atg tgt acc aga tec aaa tct aat gtg gtt ctg ctg 84 9 

Glu Trp Lys Arg Met Cys Thr Arg Ser Lys Ser Asn Val Val Leu Leu 
210 215 220 

tea get gga gaa aaa aat gag cag acg att aaa atg aaa gaa gaa ate 897 
Ser Ala Gly Glu Lys Asn Glu Gin Thr He Lys Met Lys Glu Glu He 
225 230 235 240 

att gag eta agt gga gta tct tec caa cca aag aat gaa gag gaa att 945 
He Glu Leu Ser Gly Val Ser Ser Gin Pro Lys Asn Glu Glu Glu He 
245 250 255 

gaa att att cca gtg cag gag gaa gaa gaa gaa gaa gca gaa ata aat 993 
Glu He He Pro Val Gin Glu Glu Glu Glu Glu Glu Ala Glu He Asn 
260 265 270 

ttt cca gca cct ccc caa gag cag gaa tec ttg cca gtg gaa aat gag 1041 
Phe Pro Ala Pro Pro Gin Glu Gin Glu Ser Leu Pro Val Glu Asn Glu 
275 280 285 

ate get cct taa actcttttct tttctaagca ttattgttta gagagcttcc 1093 
He Ala Pro 
290 

aagacacata gttaccctca tctcttgtgg ccttccacaa tctattctcc atattttcac 1153 

agcttaactt tgcatagaga agccacatct agctctcctt cacatttgaa gaatgcagtg 1213 

attataaaag attgtctttt gecttgetta gggagtctta cactggcaga aacgetgaag 1273 

aatccaattc tcattcacct tttccttgga tgtgtgtctc agtagtggta atggtttttc 1333 

cgcatttcct ccatcagcag ttacagcaga aggagtcaga gagttcaggg aattc 1388 



<210> 48 
<211> 291 
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<212> PRT 

<213> Mus musculus 

<400> 48 

Met Ser Gly Pro Phe Pro Ala Glu Pro Thr Lys Gly Pro Leu Ala Met 
15 10 15 



Gin Pro Ala Pro Lys Val Asn Leu Lys Arg Thr Ser Ser Leu Val Gly 
20 25 30 



Pro Thr Gin Ser Phe Phe Met Arg Glu Ser Lys Ala Leu Gly Ala Val 
35 40 45 



Gin lie Met Asn Gly Leu Phe His lie Thr Leu Gly Gly Leu Leu Met 
50 55 60 



lie Pro Thr Gly Val Phe Ala Pro lie Cys Leu Ser Val Trp Tyr Pro 
65 70 75 80 



Leu Trp Gly Gly lie Met Tyr He He Ser Gly Ser Leu Leu Ala Ala 
85 90 95 



Ala Ala Glu Lys Thr Ser Arg Lys Ser Leu Val Lys Ala Lys Val He 
100 105 110 



Met Ser Ser Leu Ser Leu Phe Ala Ala lie Ser Gly He He Leu Ser 
115 120 125 



He Met Asp He Leu Asn Met Thr Leu Ser His Phe Leu Lys Met Arg 
130 135 140 



Arg Leu Glu Leu He Gin Thr Ser Lys Pro Tyr Val Asp He Tyr Asp 
145 150 155 160 



Cys Glu Pro Ser Asn Ser Ser Glu Lys Asn Ser Pro Ser Thr Gin Tyr 
165 170 175 



Cys Asn Ser He Gin Ser Val Phe Leu Gly He Leu Ser Ala Met Leu 
180 185 190 



He Ser Ala Phe Phe Gin Lys Leu Val Thr Ala Gly He Val Glu Asn 
195 200 205 



Glu Trp Lys Arg Met Cys Thr Arg Ser Lys Ser Asn Val Val Leu Leu 
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210 215 220 



Ser Ala Gly Glu Lys Asn Glu Gin Thr lie Lys Met Lys Glu Glu He 
225 230 235 240 



He Glu Leu Ser Gly Val Ser Ser Gin Pro Lys Asn Glu Glu Glu He 
245 250 255 



Glu He He Pro Val Gin Glu Glu Glu Glu Glu Glu Ala Glu He Asn 
260 265 270 



Phe Pro Ala Pro Pro Gin Glu Gin Glu Ser Leu Pro Val Glu Asn Glu 
275 280 285 



He Ala Pro 
290 



<210> 49 

<211> 1010 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDNA 

<222> (1) . . (1010) 

<220> 

<221> CDS 

<222> (23) . . (730) 

<300> 

<30l> Ra, C, Jouvin, M-He, Kinet, J-P 

<302> Complete structure of the mouse mast cell receptor for IgE 
(FceRI) and surface expression of chimeric receptors (rat -mouse -human) 
on transfected cells 

<303> J Biol Chem 

<304> 264 
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<305> 26 

<306> 15323-15327 

<307> 1989- - 

<309> - - 



<400> 49 

ggatagccca attaatgaaa aa atg gac aca gaa aat agg age aga gca gat 52 

Met Asp Thr Glu Asn Arg Ser Arg Ala Asp 
15 10 



ctt get etc cca aat cca caa gaa tec tec agt gca cct gac att gaa 100 
Leu Ala Leu Pro Asn Pro Gin Glu Ser Ser Ser Ala Pro Asp lie Glu 
15 20 25 

etc ttg gaa gca tct cct gee aaa gca gee cca cca aag cag aca tgg 14 8 

Leu Leu Glu Ala Ser Pro Ala Lys Ala Ala Pro Pro Lys Gin Thr Trp 
30 35 40 

egg aca ttt ttg aag aaa gag ttg gag ttc ctg gga gca aca caa att 196 
Arg Thr Phe Leu Lys Lys Glu Leu Glu Phe Leu Gly Ala Thr Gin lie 
45 50 55 

ctg gtt ggt ttg ata tgc ctt tgt ttt gga aca att gtc tgc tec gta 244 
Leu Val Gly Leu lie Cys Leu Cys Phe Gly Thr lie Val Cys Ser Val 
60 65 70 

etc tat gtt tea gac ttt gat gaa gaa gtg ctt tta ctt tat aaa eta 292 
Leu Tyr Val Ser Asp Phe Asp Glu Glu Val Leu Leu Leu Tyr Lys Leu 
75 80 85 90 

ggc tat cca ttc tgg ggt gca gtg ctg ttt gtt ttg tct gga ttt ttg 340 
Gly Tyr Pro Phe Trp Gly Ala Val Leu Phe Val Leu Ser Gly Phe Leu 
95 100 105 

tea att ate tec gaa aga aaa aac aca ttg tat ctg gtg aga ggc age 3 88 

Ser lie lie Ser Glu Arg Lys Asn Thr Leu Tyr Leu Val Arg Gly Ser 
110 115 120 

ctg gga gca aac att gtc agt age ate get gca ggg acg ggg ate gee 436 
Leu Gly Ala Asn lie Val Ser Ser He Ala Ala Gly Thr Gly He Ala 
125 130 135 

atg ctg ate etc aat ctg ace aat aac ttc get tat atg aac aac tgc 484 
Met Leu He Leu Asn Leu Thr Asn Asn Phe Ala Tyr Met Asn Asn Cys 
140 145 150 

aag aat gta ace gaa gac gac ggc tgc ttt gtg get tct ttc ace aca 532 
Lys Asn Val Thr Glu Asp Asp Gly Cys Phe Val Ala Ser Phe Thr Thr 
155 160 165 170 

gaa ctg gtg ttg atg atg ctg ttt etc ace ate ctg gec ttt tgc agt 580 
Glu Leu Val Leu Met Met Leu Phe Leu Thr He Leu Ala Phe Cys Ser 
175 180 185 

get gtg ttg ttc act ate tat agg att gga caa gag tta gaa agt aaa 628 
Ala Val Leu Phe Thr He Tyr Arg He Gly Gin Glu Leu Glu Ser Lys 
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190 195 200 

aag gtc cca gat gat cgt ctt tat gaa gaa tta aat gtg tat tea cca 676 
Lys Val Pro Asp Asp Arg Leu Tyr Glu Glu Leu Asn Val Tyr Ser Pro 
205 210 215 

att tac agt gag ttg gaa gac aaa ggg gaa aca tct tct cca gtt gat 724 
lie Tyr Ser Glu Leu Glu Asp Lys Gly Glu Thr Ser Ser Pro Val Asp 
220 225 230 

tea taa gaatcagggg accaggacaa tctgattcaa gtataatctt gaaagttgat 780 

Ser 

235 

ctttttacaa aattctcgea aaatttctgt ttgttccaca ttctgtcagt ttttcaattg 840 

gattgttctg cagatgccac tcttttagtt atgctgtatc tgatcttcta aatatctccc 900 

tttttgcgga tatcattcac tccaattttc ttgttttgtg tcacaatttc acatacatct 960 

tttctggaaa gtcatcaagg aataagttgg ctttattgta tgtctacttt 1010 

<210> 50 

<211> 235 

<212> PRT 

<213> Mus musculus 



<400> 50 

Met Asp Thr Glu Asn Arg Ser Arg Ala Asp Leu Ala Leu Pro Asn Pro 
1 5 10 15 



Gin Glu Ser Ser Ser Ala Pro Asp lie Glu Leu Leu Glu Ala Ser Pro 
20 25 30 



Ala Lys Ala Ala Pro Pro Lys Gin Thr Trp Arg Thr Phe Leu Lys Lys 
35 40 45 



Glu Leu Glu Phe Leu Gly Ala Thr Gin lie Leu Val Gly Leu lie Cys 
50 55 60 



Leu Cys Phe Gly Thr lie Val Cys Ser Val Leu Tyr Val Ser Asp Phe 
65 70 75 80 



Asp Glu Glu Val Leu Leu Leu Tyr Lys Leu Gly Tyr Pro Phe Trp Gly 
85 90 95 



Ala Val Leu Phe Val Leu Ser Gly Phe Leu Ser lie lie Ser Glu Arg 
100 105 110 
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Lys Asn Thr Leu Tyr Leu Val Arg Gly Ser Leu Gly Ala Asn He Val 
115 120 125 

Ser Ser He Ala Ala Gly Thr Gly lie Ala Met Leu He Leu Asn Leu 
130 135 140 

Thr Asn Asn Phe Ala Tyr Met Asn Asn Cys Lys Asn Val Thr Glu Asp 
145 150 155 160 

Asp Gly Cys Phe Val Ala Ser Phe Thr Thr Glu Leu Val Leu Met Met 
165 170 175 

Leu Phe Leu Thr He Leu Ala Phe Cys Ser Ala Val Leu Phe Thr He 
180 185 190 

Tyr Arg He Gly Gin Glu Leu Glu Ser Lys Lys Val Pro Asp Asp Arg 
195 200 205 

Leu Tyr Glu Glu Leu Asn Val Tyr Ser Pro lie Tyr Ser Glu Leu Glu 
210 215 220 

Asp Lys Gly Glu Thr Ser Ser Pro Val Asp Ser 
225 230 235 

<210> 51 

<211> 2542 

<212> DNA 

<213> rat 

<220> 

<221> cDNA 

<222> (1) . . (2542) 



<400> 51 
acgtttctgt 


gtaacaatat 


cttttattcc 


tggatagtcc 


aattaatgaa 


aaaaatggac 


60 


acagaaaata 


agagcagagc 


agatcttgct 


ctcccaaacc 


cacaagaatc 


ccccagcgca 


120 


cctgacattg 


aactcttgga 


agcgtcccct 


cctgcaaaag 


ctctaccaga 


gaagccagcc 


180 


tcacccccac 


cacagcagac 


atggcagtca 


tttttgaaga 


aagagttgga gttcctgggc 


240 
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gtaacccaag ttctggttgg tttgatatgc 
ctccagactt cagactttga cgacgaagtg 

5 

tggggtgcag tgctgtttgt tttgtctgga 
acactgtatc tggtgagagg cagcctggga 
10 ttggggatcg ccatattgat tctcaatctg 
aaggatataa ccgaagacga tggttgcttc 
atgttgctgt ttctcaccat cctggccttt 

15 

attggacaag aatttgagcg tagtaaggtc 
gtgtattcac caatttacag tgcgttggaa 
20 tcataagaat caaggggcca ggacaatctg 
ttgcaacatt atggcaacat ttctgtttcc 
gttctgtaga tacccctgtt tcagttatga 

25 

gtggatatcg ttcaatccag ttttcttgtt 
ggaaagtcat caaaaacaag ctagctttta 
30 ggaggagtta ttttgagagt ttaactaaac 
ttcattttag aattctcaac aatacttgtg 
tatatatatt taatatggta tttacttttc 

35 

aatgggtttt tcagagacat aaagdtttat 
atattcacta atacagttgt tgctcagtgg 
40 gagaacatta ttaaatcatt cagtgtagtg 
agtgtgtata tcttttggta agacatgtgt 
agctggaaag tagtcaatgg gaatgacaag 

45 

taactaccaa caattggtag agattgccat 
gcctctgtta cacagtgagt tagaggccag 
50 acgaaagaag atgttggttc aaactgttaa 
tttgactgat atattgagtg cttgagtttt 
gaggggagca gtgatgcaga agtctggatg 

55 

tgaaacgtat ttccagactt ttgttctgag 
agaaggtcaa gacaccaaaa gaattatttt 
60 tgcctatact tagaaggtga agtaaggatc 



ctttgttttg 


gaacagttgt 


ctgctccaca 


300 


cttttattat 


atagagcagg 


ctacccattc 


360 


tttttgtcaa 


ttatgtccga 


aaggaaaaac 


420 


gcaaacattg 


tcagcagcat 


cgctgcaggc 


480 


agcaacaact 


ccgcttatat 


gaactactgc 


540 


gtgacttctt 


tcatcacaga 


actggtgttg 


600 


tgcagtgccg 


tgctgctcat 


tatctatagg 


660 


cccgatgacc 


gtctctatga 


agaattacat 


720 


gacacaaggg 


aagcgtccgc 


accagtggtt 


780 


attccagtat 


agtcttgaga 


gtcgatcttt 


840 


tccgcactct 


atcaactttt 


caattggatt 


900 


tgcctctggt 


ctttaattat 


ctcccttttt 


960 


ttgtgtcaca 


gtctcacata 


caacctttct 


1020 


ttgcatgtct 


actttcatga 


acaaaaggaa 


1080 


ttagataatc 


aggtaatatt 


tgactcttag 


1140 


catgatatat 


gcccaccata 


tcaagccttc 


1200 


tatgtagata 


gattttccac 


cctcaataat 


1260 


gaaaagacac 


atattatcta 


attcatgggt 


1320 


tgtttactac 


ttggtgggta 


gtaggtaata 


1380 


agatgcatag 


gtaaaatcag 


ggacactgtg 


1440 


gaaaatgaag 


aataaactga 


tgaagacttg 


1500 


aaatgattgt 


gtataacact 


tgtagataaa 


1560 


gtatgcctaa 


aatctcccag 


cccaaggcca 


1620 


tctgggctac 


acaagatcat 


acatcaaagg 


1680 


cacagtaagg 


gatatttaaa 


caaacagaag 


1740 


taataaaact 


gaatgaataa 


cattgcgggg 


1800 


atggaggagt 


agcagaatca 


gatgaaacat 


1860 


atggttataa 


gagcaatcac 


cattaaatga 


1920 


gagatagaat 


taagacagtc 


aaaatccaca 


1980 


aaaagtagaa 


agcctaacga 


ttagttggaa 


2040 
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aagcatatta 


cgttaggcag 


cagatgtcta 


tagtggagaa 


aagttaaaca 


aggagaaata 


2100 


atgaaccacc 


agagactcta 


catgttggtt 


tgggaaataa gagaaaatag 


caattctaaa 


2160 


cgaatgcaaa 


ctctgaagaa 


gcatttccca 


aagggtgtgg 


gcagaggacc 


agaacatttg 


2220 


caaatgtacc 


tagagagcaa 


acctgaatag gaggtaaaat 


gggggaaaag 


cagctaagaa 


2280 


aatgattttg 


ttgctgttat 


ttagatttta 


aaagaaacaa 


aaagagtcat 


taaaaatctg 


2340 






tgttctctgt 


gtatgtccaa 


agtacaggta 


art t* ttr t* aa 


2400 


atcttcctgt 


aaggctcacc 


tcatatgtct 


cttcacatag 


ccacaccctt 


gattcacagt 


2460 


tactctacca 


cagtagtaaa 


ctgtgcttgt 


ggtctcctta 


tgtatcttcc 


actagtgttt 


2520 


ataaaataaa 


tcagaattat 


tt 








2542 



<210> 52 

<211> 243 

<212> PRT 

<213> rat 

<400> 52 

Met Asp Thr Glu Asn Lys Ser Arg Ala Asp Leu Ala Leu Pro Asn Pro 
15 10 15 

Gin Glu Ser Pro Ser Ala Pro Asp lie Glu Leu Leu Glu Ala Ser Pro 
20 25 30 

Pro Ala Lys Ala Leu Pro Glu Lys Pro Ala Ser Pro Pro Pro Gin Gin 
35 40 45 

Thr Trp Gin Ser Phe Leu Lys Lys Glu Leu Glu Phe Leu Gly Val Thr 
50 55 60 

Gin Val Leu Val Gly Leu He Cys Leu Cys Phe Gly Thr Val Val Cys 
65 70 75 80 

Ser Thr Leu Gin Thr Ser Asp Phe Asp Asp Glu Val Leu Leu Leu Tyr 
85 90 95 

Arg Ala Gly Tyr Pro Phe Trp Gly Ala Val Leu Phe Val Leu Ser Gly 
100 105 110 

Phe Leu Ser He Met Ser Glu Arg Lys Asn Thr Leu Tyr Leu Val Arg 
115 120 125 

Gly Ser Leu Gly Ala Asn He Val Ser Ser He Ala Ala Gly Leu Gly 
130 135 140 

lie Ala He Leu He Leu Asn Leu Ser Asn Asn Ser Ala Tyr Met Asn 
145 150 155 160 
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Tyr Cys Lys Asp He Thr Glu Asp 
165 

He Thr Glu Leu Val Leu Met Leu 

5 180 

Cys Ser Ala Val Leu Leu He He 
195 200 

10 Arg Ser Lys Val Pro Asp Asp Arg 
210 215 



Asp Gly Cys Phe Val Thr Ser Phe 
170 175 

Leu Phe Leu Thr He Leu Ala Phe 
185 190 

Tyr Arg He Gly Gin Glu Phe Glu 
205 

Leu Tyr Glu Glu Leu His Val Tyr 
220 



15 



20 



Ser Pro He Tyr Ser Ala Leu Glu Asp Thr Arg Glu Ala Ser Ala Pro 
225 230 235 240 

Val Val Ser 



<210> 53 
<211> 1323 



<212> DNA 
25 <213> dog 



<220> 

30 

<221> CDNA 





<222> (1) 


. . (1323) 




35 


<400> 53 








cccagctcac 


tgcaccaacg 


tccttccctg 


40 


ccagaaaaca 


gtggcagggc 


aggccttgct 




cctgaaattg 


aactttcaga 


agtacctctc 


45 


tcatccccac 


cgtgccatac 


gtggctgatg 




gtaacacaaa 


ttctgattgc 


cttggtatgt 




atcaatattt 


cagaatttaa 


ggaagacatt 


50 


tggggagcag 


tgttttttgc 


tatttctgga 




gctacatatc 


tggcgtgggg 


cagcctggga 


55 


ggaatcttta 


tctacggtca 


acctgcagac 


tgccagcaag 


ctcccagaga 


cgacttctgc 




gcaatgatcc 


tgtttctcac 


tattctgggg 


60 


ggagttggag 


aattagtcca 


agggaacaag 



gccgacagct 


cggctaatga 


gaaaatggag 


60 


ctcccgaacc 


cgcagggacc 


ctccagtgca 


120 


cgtgaaaacc 


ccttactgga 


gaaggctggt 


180 


tttctgaaga 


gggagctgga 


atttctgggg 


240 


ctttgttttg 


gaataattgt 


ctgctccgtg 


300 


ttttcatcat 


ttaaagcagg 


ctacccattc 


360 


tttttgccaa 


ttatgtctga 


aaagaaacat 


420 


gccaacactg 


tccagatagc 


tgcgggaata 


480 


ctgcagagga 


gctcggctta 


catccacaac 


540 


tttgtggctt 


gtttttccac 


agaaattgtg 


600 


ttttgcagtg 


ctgtgtcact 


gacgatctat 


660 


attgtagaag 


atcgtcttta 


tgaagaatta 


720 
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aacatatatt caccgattta cagtgagctg gaagacaaag cggaggaatc ttctcctgtt 780 

gattcataag aattgtgttc agaacattct cacagaaaag ggtctagaat aagccaagat 84 0 

ctttgttaaa ggggctacag gcaaaatttt gattcttgtc tatgatctgc tagttttaca 900 

ctaggattta ttctccaaat ggtatatgct gcctgttgtt tctgttcacc taaaattccc 960 

ctcccaccac ccacttatgg gtatgtgata gactcctatt ttcttgtttc tattgctctc 1020 

tggtagcact ttatggaagt cacgtgtaat gagcaagcat ttattctgtg cctataactg 1080 

tacaaataga gggaacaagt ggaaggggtt gagagctgag ttaaactttc tgttttggaa 114 0 

cacctgggtg gatccgtcaa ttaagagtct gactcttgac tgcagactca ggtcatgaga 1200 

tcaagcccca tgtcaggctc ctcactcagt ggggagtctg cttgggattc tttctctcct 1260 

tctccctcta tcccttcttc tgctctctgt ctcccaatct cataaataaa taaatataat 1320 

ttt 1323 

<210> 54 

<211> 137 

<212> PRT 

<213> dog 

<400> 54 

Met Asn Ser Gly Arg Ala Gly Ala Asn Gly Ser Ser Ala Ser Val Arg 
1 5 10 15 

Asn Lys Ala Gly Ser Ser Cys His Thr Trp Met Lys Arg Gly Val Thr 
20 25 30 

Ala Val Cys Cys Gly Val Cys Ser Val Asn Ser Lys Asp Ser Ser Lys 
35 40 45 

Ala Gly Tyr Trp Gly Ala Val Ala Ser Gly Met Ser Lys Lys His Ala 
50 55 60 

Thr Tyr Ala Trp Gly Ser Gly Ala Asn Thr Val Ala Ala Gly Gly Tyr 
65 70 75 80 

Gly Ala Asp Arg Ser Ser Ala Tyr His Asn Cys Ala Arg Asp Asp Cys 
85 90 95 

Val Ala Cys Ser Thr Val Ala Met Thr Gly Cys Ser Ala Val Ser Thr 
100 105 110 

Tyr Gly Val Gly Val Gly Asn Lys Val Asp Arg Tyr Asn Tyr Ser Tyr 
115 120 125 

Ser Asp Lys Ala Ser Ser Val Asp Ser 
130 135 
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<210> 55 

<211> 1222 

<212> DNA 

<213> pig 

<220> 

<221> CDNA 

<222> (1) . . (1222) 

<220> 

<221> CDS 

<222> (96).. (827) 

<400> 55 

gtaatatcca ggtaagtatc tcagattagc acagagcttg gagaagagag agacagagag 60 

aggaagccaa gaagtgccaa aggaacagag acaaa atg atg tea tec aag cca 113 

Met Met Ser Ser Lys Pro 
1 5 

aca ace tac cct gga gtg tat gga ace aca cct gat ctt tac cca cca 161 
Thr Thr Tyr Pro Gly Val Tyr Gly Thr Thr Pro Asp Leu Tyr Pro Pro 
10 15 20 

agt aac ttt atg gtg cct ggg tct caa caa ccc cca ggg ttc ate aac 2 09 

Ser Asn Phe Met Val Pro Gly Ser Gin Gin Pro Pro Gly Phe lie Asn 
25 30 35 



cca cga ate caa gtt cag agt age cag 
Pro Arg lie Gin Val Gin Ser Ser Gin 
40 45 

gga ate ccc aat aac agt cag cag gtt 
Gly lie Pro Asn Asn Ser Gin Gin Val 
55 60 

aat cca ggt aca gga aaa gca gee aca 
Asn Pro Gly Thr Gly Lys Ala Ala Thr 
75 

aca eta ggg gca ate cag ate ctg att 
Thr Leu Gly Ala He Gin He Leu He 
90 95 

gga att att ctg ggc tta atg ggc cgc 
Gly He He Leu Gly Leu Met Gly Arg 



gca cct ttc ate gtc tct cca 257 
Ala Pro Phe He Val Ser Pro 
50 

caa gga aat ata caa atg gta 305 
Gin Gly Asn lie Gin Met Val 
65 70 

aac ttt aaa gaa gaa gca aag 353 
Asn Phe Lys Glu Glu Ala Lys 
80 85 

gga tea atg cac att ggc ttc 4 01 

Gly Ser Met His lie Gly Phe 
100 

act tat atg caa gtt ttg ggt 44 9 

Thr Tyr Met Gin Val Leu Gly 
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105 110 115 

ttt get tec ttg get ttt gtt age gga tat cca ttc tgg ggc ggc etc 497 

Phe Ala Ser Leu Ala Phe Val Ser Gly Tyr Pro Phe Trp Gly Gly Leu 

120 125 130 

tct ttc att att act gga att etc tgt att ttg gca tec aag aag tct 545 

Ser Phe lie lie Thr Gly lie Leu Cys lie Leu Ala Ser Lys Lys Ser 

135 140 145 150 



tec cct get ctg ata aaa age age ctg gga atg agt att gtt agt tec 593 
Ser Pro Ala Leu lie Lys Ser Ser Leu Gly Met Ser lie Val Ser Ser 
155 160 . 165 

ttc ttt gee ttc att ggg atg ate ctg ttg ctg gtg gat gaa age ate 641 
Phe Phe Ala Phe He Gly Met He Leu Leu Leu Val Asp Glu Ser He 
170 175 180 

aac ggg ctg ccc gag caa gac tac tgg gcg gtg ctt tct gga aaa ggc 689 
Asn Gly Leu Pro Glu Gin Asp Tyr Trp Ala Val Leu Ser Gly Lys Gly 
185 190 195 

att tea gee atg ctg att ate ttc tct etc ttg gag ttc tgc ata act 737 
He Ser Ala Met Leu He He Phe Ser Leu Leu Glu Phe Cys He Thr 
200 205 210 

tgt gtc aca gee tat ttt gec age cac ace ate acc aac acc agg ggt 785 
Cys Val Thr Ala Tyr Phe Ala Ser His Thr He Thr Asn Thr Arg Gly 
215 220 225 230 

ctg tec tgg tea ttc cac ctg tgt atg caa aca gtc cct tga 827 
Leu Ser Trp Ser Phe His Leu Cys Met Gin Thr Val Pro 
235 240 

cgacaggatt gtcctcagtt cctcccccag ttccggccac ttagctcaca ccacctaata 887 

ccagtaggta ttaaaagaag aagaagcatc ctgeaattte atgggggtag aaccgctctg 947 

cagaaacttc ttaagaagtt ggtcttcgcc gttcacgatg attgetaate tttaaagact 1007 

gtgtttcagg ttctgttctt ggtttgtcat gaatgaaaac tctatggacc tctctaccag 1067 

tgtttcttcc tacatgccac taccacgtgc tggcaaaggt gaagggctgg aggactggaa 1127 

atgactctgc aactctctta aagttagaaa tttcctgtta aagttaccct tccctcaata 1187 

aaatgttact aggagcaaaa aaaaaaaaaa aaaaa 1222 

<210> 56 

<211> 243 

<212> PRT 

<213> pig 



<400> 56 
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Met Met Ser Ser Lys Pro Thr Thr Tyr Pro Gly Val Tyr Gly Thr Thr 
15 10 15 



Pro Asp Leu Tyr Pro Pro Ser Asn Phe Met Val Pro Gly Ser Gin Gin 
20 25 30 



Pro Pro Gly Phe He Asn Pro Arg He Gin Val Gin Ser Ser Gin Ala 
35 40 45 



Pro Phe He Val Ser Pro Gly He Pro Asn Asn Ser Gin Gin Val Gin 
50 55 60 



Gly Asn He Gin Met Val Asn Pro Gly Thr Gly Lys Ala Ala Thr Asn 
65 70 75 80 



Phe Lys Glu Glu Ala Lys Thr Leu Gly Ala He Gin He Leu He Gly 
85 90 95 



Ser Met His He Gly Phe Gly He lie Leu Gly Leu Met Gly Arg Thr 
100 105 110 



Tyr Met Gin Val Leu Gly Phe Ala Ser Leu Ala Phe Val Ser Gly Tyr 
115 120 125 



Pro Phe Trp Gly Gly Leu Ser Phe He He Thr Gly He Leu Cys He 
130 135 140 



Leu Ala Ser Lys Lys Ser Ser Pro Ala Leu He Lys Ser Ser Leu Gly 
145 150 155 160 



Met Ser lie Val Ser Ser Phe Phe Ala Phe He Gly Met lie Leu Leu 
165 170 175 



Leu Val Asp Glu Ser He Asn Gly Leu Pro Glu Gin Asp Tyr Trp Ala 
180 185 190 



Val Leu Ser Gly Lys Gly lie Ser Ala Met Leu He lie Phe Ser Leu 
195 200 205 



Leu Glu Phe Cys lie Thr Cys Val Thr Ala Tyr Phe Ala Ser His Thr 
210 215 220 



lie Thr Asn Thr Arg Gly Leu Ser Trp Ser Phe His Leu Cys Met Gin 
225 230 235 240 
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<210> 57 

<211> 6 

<212> PRT 

<213> Homo sapiens, Mus Musculus . 



<220> 

<221> conserved_motif 

<222> (1) . . (6) 

<223> X is any amino acid 

<400> 57 



Leu Gly Ala Xaa Gin lie 
1 5 



<210> 58 

<211> 4 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 

<400> 58 

Leu Ser Leu Gly 
1 

<210> 59 

<211> 6 

<212> PRT 

<213> Homo sapiens, Mus musculus 



<220> 

<221> misc_f eature 

<223> X is any amino acid 
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<400> 59 

Ser Gly Xaa Leu Ser lie 
1 5 



<210> 60 

<211> 6 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 

<400> 60 

Gly Tyr Pro Phe Trp Gly 
1 5 

<210> 61 

<211> 8 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 

<400> 61 

Phe lie lie Ser Gly Ser Leu Ser 
1 5 

<210> 62 

<211> 15 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 
<220> 

<221> conserved_motif 

<222> (1) . . (15) 

<223> X is any amino acid 
<400> 62 

Ser Leu Xaa Xaa Asn Xaa Xaa Ser Xaa Xaa Xaa Ala Xaa Xaa Gly 
15 10 15 
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<210> 63 

<211> 21 

<212> DNA 

<213> Mus musculus 

<400> 63 

cacgaggcac acaagcaaag c 21 

<210> 64 

<211> 25 

<212> DNA 

<213> Mus musculus 



<210> 65 

<211> 24 

<212> DNA 

<213> Mus musculus 

<400> 65 

tgggtgagaa cacacaatca aaac 24 

<210> 66 

<211> 24 

<212> DNA 

<213> Mus musculus 



<400> 64 

aagtgcttga cttacatact tacag 



25 



<400> 66 

cacatacaca caagagaatt agac 



24 



<210> 



67 



<211> 24 
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<212> DNA 

<213> Mus musculus 

<400> 67 

ccattgtctg tactgtttct gctg 

<210> 68 

<211> 23 

<212> DNA 

<213> Mus musculus 

<400> 68 

gccaaatgca tacacatgtg cac 

<210> 69 

<211> 24 

<212> DNA 

<213> Mus musculus 

<400> 69 

ctggaagtga ctgggtgaca aggc 

<210> 70 

<211> 22 

<212> DNA 

<213> Mus musculus 

<400> 70 

caatcgttcc tcatatgcac ag 

<210> 71 

<211> 25 

<212> DNA 

<213> Mus musculus 
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24 



23 



24 
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<400> 71 

agactctggt ggtcattact gtctc 25 



<210> 72 

<211> 22 

<212> DNA 

<213> Mus musculus 



<400> 72 

gaatgccaaa tgcacagaaa gg 22 

<210> 73 

<211> 12952 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . . (12952) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (4390) . . (4548) 



<220> 

<221> coding_region 
<222> (5017) . . (5136) 



<220> 

< 2 2 1 > codi ng_r eg i on 
<222> (6303) . . (6359) 
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<220> 
5 <221> 
<222> 

10 

<220> 
<221> 
15 <222> 



coding_region 
(7813) . . (8049) 

coding__region 
(8851) . . (8952) 



<220> 

20 

<22 1> coding_region 
<222> (10142) . . (10357) 

25 



<400> 73 





tatctcatat 


tctcaaaatg 


atactttgaa 


30 


cagagagacc 


acaagaggag 


caagcaagag 




gccagaaatt 


gggagactga 


aggtgcaaag 


35 


ttggttggac 


tgagatgggc 


tgaaaagcac 




cggagatagg 


tgcacaggag 


gccagaaggg 




agaatgaaga 


tgaggtaaca 


acagtgaggc 


40 


cgagtggact 


taaaagtcta 


ttcaaaccaa 




tggatggaca 


tgaaaacagg 


ggtgccttta 


45 


tgagtaatta 


aatgcggtac 


tgagtgatca 




tttggacacc 


agcaaatgta 


aaatcttatt 




acccctacta 


atctaccaac 


tagtctttga 


50 


gtgaacaatt 


tattttaatg 


ataaggcaaa 




gttttatgta 


tttatgaatt 


ctcatggaac 


55 


gccataaatc 


gattaatata 


tttacctctc 




tatatatatg 


aaagatagtt 


tccataagaa 




tcttttataa 


actactattt 


tatacaaata 


60 


caccatgaga 


taccatttta 


acagagaaat 



atggggaagg 


taactattac 


cttaaactta 


60 


agctgggact 


aactctgtct 


gctacatatt 


120 


aagaaataat 


ttcttggaag 


ggctaacagc 


180 


agagggaagg gaagtaagag 


cactcttcct 


240 


ataaaaatgc 


aaaactattt 


tgagtggaag 


300 


agtttttaaa 


cttctcagta 


catgatggta 


360 


aacccataaa 


ggctgtgaga 


tcttattcag 


420 


aaaataaaaa 


aatatttgca 


ggagttagtg 


480 


cctgagtctc 


tattttaaca 


ttcatgctga 


540 


gttatcatat 


tgcggctaat 


gtggtccata 


600 


gcataactgt gcattgggca 


ctaagtcatt 


660 


attgcgagaa 


tctccaatct 


ccgtaggcga 


720 


gaaagctttt 


cattacagtc 


agcccacaat 


780 


atatatgaga 


cagatatata 


tataatcaga 


840 


ataaaaaaac 


agagaaagca 


ctcatgtgtt 


900 


tttgtttaag gtttgtagaa 


gtatagttgc 


960 


atcacacaca 


cacacacaca 


gccctcatgg 


1020 
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tcattattgc cattagttac aacaggcata 
tatctcgcca taatacagac acagctttaa 
5 aacaatggca gtccaatcat ctccttcagt 
ttatcttcac taaagttacc cctctggggc 
cttgccatag gaccttggga aaaataacct 
agtagtcaaa atgcctactt acgaaggtgt 
ctctagtata tagtaggtgc tcagtaaaca 

15 agcagaaaaa gcatcaacat cagagccatg 
aactgtgagg tctttactaa ggcaaattct 
aatgaacaga atggtttctg aggtttggac 
atcagtccgc aaactcttta gtgggctata 
tagagtggct tgttgtacaa agcagatccc 

25 caggggcaaa atcaagtgag aaacaaccaa 
gccagacaga taatactttt caactgattt 
ggagaaaaga ggggtcataa gggaaaggct 

30 

gcctggtacc ctggatgcct aatttgagag 
ctgggcatgg tggctcacag ctgttatcct 
35 gcttgagtcc aggagttcaa gaccagcctg 
aaatataaaa attagctgag tatggtggtg 
tgaggcagaa ggatcacctg agcccaggag 

40 

actgcactgc agcctgggcg acagagcgag 
aaaatattta ctaattatgt aacttggcac 
45 cctcttctat aaaacacagg tgatgattca 
gatgccatgt gtcaaattta cttccagccc 
ctgcctgttt tacagcacac ctgcagctac 

50 

ttcaggctgt ggtctaatgg ctgggatgtg 
atagacagtc ttgatttcca agacatctta 
55 aagatgtgtc cccaaaagag ggattttcag 
aagtggttaa catgaacaag tgaatgctaa 
ttaaaatgtc tgagctggaa agggccgtat 

60 

atcatgaaaa ctgataaata gcatttacta 
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gcccaagtag aattggctta caaaaaagtt 1080 

tcaagcagtt gatgaataaa atgaaattaa 114 0 

tcaaacaatt gatgaccttt gcattttaag 12 00 

ggagtgtagt gactcccagc tcattccata 1260 

ctctcaacct ccatttcttc atctgcagaa 1320 

tgagtgagtt ggcaatacca tatacaatat 1380 

actataatta tctggactca cctatggtac 1440 

gtgagtgcct tctagcttgg ctctgttgct 1500 

atacccagac tgcagcttcc tcatttgtga 1560 

tagctctaga aatcctacct agtgaccaga 1620 

ttaaaatatc tgtttttgtt agaggtcttg 1680 

caggaccatg tggcatctgg ctatctaggg 174 0 

agtttcttct ctctttcctc tagcccatgc 1800 

atagtcattc acatagtgca tatattggaa 1860 

gaattcctat ttctcacagg agacaaagca 192 0 

atccaagtct tcattttaaa tctttacagg 1980 

agcactttgg gaggtggagg ggggcagatt 204 0 

ggcaacatgg caagacccca tctctaaaaa 2100 

gttgcccgtg gtcccagcta ctccagtggc 2160 

gtagaggctg cagtgagctg tgatcatgac 222 0 

gccctgtcat gaaaacaaac aaacaaacaa 2280 

aagtaaccta gcttctccgt gcccttgttt 2340 

ccccttgcct gttctataag gctcagatga 2400 

taaagtaggt ataaatgtga cagtagtcag 24 60 

tcagctctga ttgacttcag taagatatac 2520 

g|caccggctt cagtaagata tgtgacctgg 2580 

catgacttac aagcaataaa gacaagagca 2640 

attccaggaa agacccagtg gagtccgcat 2700 

ggaaacccaa gatgaagatt gaacagaatc 2760 

taattgcctt ctcacttcag aaatgaataa 2820 

cactctccct ttgtcctagg cacaattaca 2880 
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tatgttaact 


tatttatgtc 


ttacatctca 


gagaggggta 


ctgttctaac 


tttacagaag 


2940 


5 


gataaaatcg 


aaactaatgc 


tcagcaaagt 


acaaagaaca 


agaatagcaa 


caaaaataac 


3000 


tatttattcc 


aacatgggtt 


ctttgcatac 


atttatttct 


tcaataatat 


ttattaagaa 


3060 




gtaactaaat 


ccaaaaatta 


ttttagatcc 


tgaacaagag 


agaacaaaat 


ctctactttg 


3120 


10 


atggaacttc 


cattctgtgg 


ggaagagact 


gacaataagc 


aattaaataa 


ataaggtaat 


3180 




ttcctacagt 


gatcaatgcc 


gtaaagcaat 


taagatagga 


ttttgtaaaa 


gacagcaaat 


3240 


15 


aggagtacat 


gttatagatt 


gagggttcaa 


ggtaggctcc 


tctaggagct 


gacatttgag 


3300 


ctacacctga 


acaaaaagac 


actagccatg 


cacagaccat 


gagcccagtt 


aagtgttata 


3360 




gcagcccacg 


agataagaat 


tattattatt 


tcaattttac 


agttgaacct 


gaggcccaga 


3420 


20 


gaatttaaag 


aacttgccca 


acatctcaga 


acaaatggag 


gaatcactat 


tgaaacctag 


3480 




gcaatctgac 


tcaggaggcc 


acagtcttat 


atactgcatt 


agaaagcctt 


agagagcctt 


3540 


25 


ttctttttct 


ttgagaccga 


gtctcactct 


gttacccagg 


ctggaatgca 


gtggcatgat 


3600 


ctcagctcac 


tgcaacctct 


gcctcctggg 


ttcaagcaat 


tctcctgcct 


tagcctcccg 


3660 




agtagctggg 


attacaggtg 


cacaccaaca 


tgcctggcta 


atttttgtat 


ttgtagtaga 


3720 


30 


gatggggttt 


tgccacgtta 


gccagcccgg 


tctcaaactc 


ctgacctcag 


gtgatctgcc 


3780 




catcttggcc 


tcccaaagtg 


cttggattac 


aggcatgagc 


caccgtgccc 


gacctagaga 


3840 


35 


gccttcttga 


tgtgacttgc 


acaaggtggc 


agagttagag 


acagagagag 


gcctggaatc 


3900 


gacccctcct 


gcttctacag 


atagtcctta 


ccatactctg 


caatgttgcc 


tctggcccat 


3960 




cataatgcac 


aaaggcagat 


aagcaaaagg 


acaaggacaa 


gtccattgaa 


aatacatttt 


4020 


40 


tcaatattaa 


agcaaaagaa 


aagcatccag 


gaataagaaa 


caaagaggac 


atgcagtcat 


4080 




atatgcaagg 


tgtcctctac 


aaagataaag 


aatgccccaa 


acccagttgt 


caagatcact 


4140 


45 


ggcagggact 


cctgggccca 


catgctcttc 


ctaaacaacc 


cctccatctc 


ctttctcaga 


4200 


actcagcagt 


aggccttgcc 


tcagatccaa 


ggtcactcgg 


aagaggccat 


gtctaccctc 


4260 




aatgacactc 


atggaggaaa 


tgctgagaga 


agcattcaga 


tgcatgacac 


aaggtaagac 


4320 


50 


tgccaaaaat 


cttgttcttg 


ctctcctcat 


tttgttattt 


gttttatttt 


taggagtttt 


4380 




gagagcaaaa 


tgacaacacc 


cagaaattca 


gtaaatggga 


ctttcccggc 


agagccaatg 


4440 


55 


aaaggcccta 


ttgctatgca 


atctggtcca 


aaaccactct 


tcaggaggat 


gtcttcactg 


4500 


gtgggcccca 


cgcaaagctt 


cttcatgagg 


gaatctaaga 


ctttgggggt 


aagtcagttg 


4560 




ccttccatcc 


catgtcgtag 


ggattctctg 


gctgacagaa 


gctgatgcgg 


tataggccac 


4620 


60 


atacagaatt 


caatccaatt 


tgaagaattg 


ggatccaacc 


tgatgtcttc 


tttatgtcta 


4680 
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acacagtggg ccaaatcagg ggtgcatcag 
atcttatgtc accttttggt tttggggctt 
5 ttgccagaat cccaggcata cctgctccct 
ttcgaacaag aaaaagacaa aattcttggc 
cagctccaag tcaggagcca gagcttccaa 

10 

tcagctcatc cactgctgcc tctgttctct 
ttccacattg ccctgggggg tcttctgatg 
15 gtgactgtgt ggtaccctct ctggggaggc 
tgggaaatgg tgcagacaaa aatgttaaaa 
gtgttgaggg aaatatatga gtaggaaata 

20 

ttgaccaaat tgagtataaa tcccatggtt 
cagagaaagg ggccctgcat ggatctcaca 
25 tcagttctgg tggggagggt ggccctagac 
agcaagaggc agcaggaaca tccacttcca 
gttactatgg tacctggtga aacctgtccc 

30 

cagagcagca ctcttttggc tgttttacgt 
ggtgtggcac ataggacagt gatgtttatt 
35 taaaagataa gtccaggatt aaaattttcc 
ttcctccatt ctacagcctg aatgtgagaa 
cctgtaacag ccaatgtttt catggagtgc 

40 

gcatcagttc actcttcaga gggcatcaga 
cagttccaca agttaaaaac aaagaaaaac 
45 agtattaatt gatgaggaaa tttgagttct 
cagatcccca gatgattgag taaaaggcat 
aggaatatga cagctgaaaa atgaatacaa 

50 

agttgctgaa agtgtaggct tctttctgac 
ctctcctcta tctcctgtct tgcccacccc 
55 agtatattat ttccggatca ctcttggcag 
caagtaacca tatgtccttc tttcccacat 
aaactgagac ccttaaaaag ccaaggtatc 

60 

ctccacaatg ttattgtgat tttatttatg 
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agaagttatc 


acttagatca 


cctctqqqtq 


4740 


gtatatgcag 


gggtccccca 


tcccagtcca 


4800 


ggaaatgccc 


catqtqqttq 


aggaaacaga 


4860 


acctccactg 


cttcctttag 


gcattcctca 


4920 


ccttgtcttt 


gcctgctagc 


agtgatgatt 


4980 


ccccaggctg 


tccagattat 


qaatqqqctc 


5040 


atcccagcag 


qqatctatqc 


acccatctgt 


5100 


attatggtga 


gtaaaagaat 


ageagecatt 


5160 


ggctccacag 


ggatatgcca 


gattatttct 


5220 


ttattgggtt 


aaagtaatta 


aqaaqacaqq 


5280 


gagagtcagt 


ggtcctgttt 


catgtgaatt 


5340 


qqqactqtCC 


aaagcaagaa 


ctctccaaag 


5400 


atttagacta 


gatagcaaga 


tgttttggaa 


5460 


tctacccctt 


cttgcttaca 


attctgtttg 


5520 


atcacaagtc 


agtctcattt 


tgettatega 


5580 


acatgttttc 


caaatctgta 


accctgtctg 


5640 


tccccgtgat 


act tt teat a 


gt tgecacta 


5700 


aaacaagtag 


aqatqqqtta 


gaacaagagt 


5760 


aaaqaaqatq 


agcaatagtc 


atcatcactt 


5820 


ctgtgccatt 


caggtcaagt 


atttccttct 


5880 


gtcatttatg 


tcactgtgaa 


ccccaaaggg 


5940 


tagaaataaa 


acttttaaat 


ttatggtatg 


6000 


gtctctttgg 


tcttactata 


ttcctagtca 


6060 


gaatttagtg 


tcactgagcc 


tqaataaaqq 


6120 


c tgataaaaa 


taaataaata 


attotataaa 


6180 


cagttatcaa 


tgttaaaaag 


tgatctccct 


6240 


ctctccatct 


cccccacctc 


tcttttttac 


6300 


caacggagaa 


aaactctagg 


aaqtqtttqq 


6360 


gtcagagaag 


tacctatttt 


tttcggttaa 


6420 


acagcctctc 


agecctaaaa 


agcaaagacc 


6480 


aaaaacttag 


aagcgaggtc 


tatctgaagt 


6540 
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atgttcatgg gaacagaact aaaagcagat 
cgttaaatgc tgtgtgaaaa taatagacct 

5 

accatttagg aaaattatct gatcaatgtc 
ttttggccgt aacattgaac aagtcagttt 
10 aggaagggta acagtccttg ctaccatgtg 
gtgtgatggt gcttcacagg ctataaagta 
agggactcat ctagatccag aggaaactgg 

15 

aaagagttta gcagcaagtt cgctcccaaa 
gtgtcacttg cataatcaat gagggaatgg 
20 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
accttttatc atccctgcaa gggtagaaca 
ccaaagacat cagccctaat tgtattttgg 

25 

atgtcagcag gaatagatgt tgttactgtt 
ttgatagtac agatctagag ggttctatct 
30 atcccactaa agtgtgatac aagaaacttc 
ggcagataat aaactggata atatttatct 
ctactaggat ggtggcagtg gcagtgaagg 

35 

ggttactctt agaatttttg ctttataaaa 
tgagaaaaga acaggataga gacatgattt 
40 actgagagaa cttcagtata tttagttgta 
ggaattccct cccagattat gttttccaaa 
tctccatttc aggtcaaagg aaaaatgata 

45 

tctggaatga ttctttcaat catggacata 
atggagagtc tgaattttat tagagctcac 
50 ccagctaatc cctctgagaa aaactcccca 
ctgttcttgg taagtgttct tggtaagtgt 
tgacttgttt attatgagca tgaactctgg 

55 

tctgtgtggc tttgggaaag aattttaacc 
atggggatgg taacactatt tatctaatag 
60 atacaaagca tttagaacag tgtttggtac 
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ccatgaaaac 


catacctaca 


gtcttaagaa 


6600 


ttctgaaagc 


cctatcattt 


ctcccagatc 


6660 


atgattgatt 


caaattctag 


ctaagccatt 


6720 


acctctatgt 


tcctgagttt 


tcacctagaa 


6780 


acgtccaatg 


gagatgaaag 


gcagtagagt 


6840 


ctacactgtg 


gtcttgccca 


taaaacccct 


6900 


cctgcagagc 


tgctgatgct 


gtatggatga 


6960 


aaattcctcc 


cccaacactg 


ttactaaact 


7020 


gtggattgan 


nnnnnnnnnn 


nnnnnnnnnn 


7080 


nnnnnnnnnn 


nnttatattt 


tcctgtttca 


7140 


ttcacactga 


tattttctta 


cctatgctac 


7200 


aagatagctg 


actggggctg 


attgcaacct 


7260 


gttgcttctg 


ctttttttat 


ttccatttat 


7320 


gaaccttccc 


aacctatact 


tcataatacc 


7380 


ttcactctct 


tccctctacc 


tatttatgaa 


7440 


tcacttattc 


aacaaacatt 


tattgagtgc 


7500 


aaatgcaagg 


atacaagata 


tagaatcaag 


7560 


cagatggatg 


gtgaatgaga 


tagggaagac 


7620 


tattttatag 


tgacaaagag 


gctaaaaaca 


7680 


gttgctttgt 


gagtcagggc 


agttgcattt 


7740 


gggaaatcaa 


acccaattaa 


taaatctgtg 


7800 


atgaattcat 


tgagcctctt 


tgctgccatt 


7860 


cttaatatta 


aaatttccca 


ttttttaaaa 


7920 


acaccatata 


ttaacatata 


caactgtgaa 


7980 


tctacccaat 


actgttacag 


catacaatct 


8040 


gagattggat 


ttctctccag 


ggaggaagga 


8100 


atccagacca 


cctgagtttg 


ctagttactg 


8160 


acactgtgcc 


tcaatttctt 


caactgtaaa 


8220 


ggttgttctg 


aggaataaat 


aagttaatat 


8280 


acagaagtgc 


tatataagca 


ttggctttaa 


8340 
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cattaattat tctcaacatc attaatggca 
agcatagaag aaattcctaa gatgttatga 
5 gagtttgaaa gagataaaaa aaaatgcatc 
atgccacacc aactaaggct gaatacaatt 
atacagaatt gattcttcaa ctgattattc 

10 

aaccatctgt tgtgtgccaa aagttgtgtt 
tgtactagca gttctcacag ctattcatta 

15 caattatagt caacaataac ttactgaaca 
tttcaccttc attcttctgt tgtttttcag 
ttcttccagg aacttgtaat agctggcatc 
agacccaaat ctgtaagtag tagcccctct 
ccacaaagga tcatttatgg agaatgtaat 

25 ttaaaaaaaa aaattggtct tggcatattt 
tttgaaataa gtctgtagtt agagcctaca 
tttttttgag acagagtctc actttgttgc 

30 

cccagtgcaa cctctgcctc cgggttcagg 
tgggattaca ggcacccgcc atcatgcctg 
35 gttcaccatg ttggccaggc cagtcttgaa 
gcctccgaaa gtgctgggat tgcaggcgtg 
ttattaacct tccctgctgc ttgacttcag 

40 

cagtgtgtaa aagcaagatg gttgaaggtt 
tttttcagta ttcccctctt ccctcctttt 
45 aaagggctcc acaaatggga aataaaaaca 
cagaaccaga ctgttagatt tagaatttag 
taaagaaaaa tagtatcaaa gagagaaaat 

50 

aaatcagaac gaaaatctaa actcctctat 
ggtcattcca ggcactgtgg tcaatgtctg 
55 tttaataaat gacttgataa ggatataagc 
atataataaa tgacataagt agcataaaaa 
attgttgaca aaggtgtcag tggtaaaagt 

60 

gaacatagtt ctcctgtcag cagaagaaaa 
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ttaatattta 


ccattatatt 


aactaagatg 


8400 


atatctttta 


cccaaaaact 


tttaaatqta 


8460 


tcccaaggat 


aaacraacitct 


cctaggcatg 


8520 


taatttgaga 


ctcattactt 


attctttcct 


8580 


ataccttact 


ttctatcagc 


aatacacatt 


8640 


aaQacit taQQ 


gt tataaaga 


tgctgtctcc 


8700 


cttgtctaaa 


gaattgatct 


ettaategtt 


8760 


ccaacaatgt 


tctt tgtgcc 


attattacat 


8820 


ggcattttgt 


cagtgatget 


ga tetttgee 


8880 


gttgagaatg 


aatggaaaag 


aacgtgctcc 


8940 


ggccaaaacc 


tccc tctaga 


aaatccacat 


9000 


c tgatgagt t 


gttgaaact a 


ggagcttgat 


9060 


ctaaaaaaca 


aataatataa 


tatcttaaaa 


9120 


gttctatatt 


ctgtgcgt c t 


tttttttttt 


9180 


ccaaactaaa 


atacaatoac 


acaatcttoa 


9240 


caattctcct 


acctcaccct 


cccaaataac 


9300 


accaattttt 


gtatttttgt 


aaaaacaaaa 


9360 


ctcccaacct 

W \r Vp- W ^-J u V#< V* w 


caggtgatcc 


tcccacctca 


9420 


age caeca tig 


ccccacc tc t 


gtgeatet tc 


94 8 0 


acttccatcc 


W Oi ^» »■» V» w a *- y 


attccatata 


9540 


qaaaqctaaq 


tcactagttc 


tttggtttca 


9600 


cacctaattc 


ctcttccaaa 


acaacaggtt 


9660 


gaccaataat 


agatatatta 


etaactgett 


9720 


aaatcatege 


taattact tc 


attttgeatt 


9780 


tattctccca 


gagttatatt 


cactaaaqqc 


9840 


tactgeaatg 


ttcttttccc 


aataccacgt 


9900 


ctgcccttga 


agatttat tc 


aaacttaaat 


9960 


acctgeaaaa 


aaattttggc 


att taaaggc 


10020 


ccaggaggta 


tttgataaat 


gtttgtggag 


10080 


aaagaatggt 


ttgtttaatt 


ttctgtttta 


10140 


aaaagaacag 


actattgaaa 


taaaagaaga 


10200 
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agtggttggg ctaactgaaa catcttccca 
tccaatccaa gaagaggaag aagaagaaac 

5 

tcaggaatcc tcaccaatag aaaatgacag 
tttccttttt taaacattag tgttcatagc 
10 gaggtactct gcacatacgc accacatctc 
tccttctctc ttacattgaa tgtagagaat 
cttcttcttt tgagcaactt tcttacactg 

15 

gatttcctac taacctgttc cttggatagg 
ttctccatca acaaccaggg agactgcacc 
20 cattcctaaa ctatcttttt tttattccac 
tcattgtttt aaggatgata aaaaaaaata 
tttatctttc tacagggctg acattgtggc 

25 

gaagctctaa atagccaaca cccatctgtt 
acatgtctct gccttaactt ttcctaactc 
30 tacttttagc cattatgcga gaaaagaaaa 
gcccaaattt caaataataa ttaacattta 
atttcatcaa ataacttcat ttgatgttcc 

35 

tacaagtaat tcaaagaggc caaataactt 
cataactagt aattatgaga gccttgtttc 
40 atgtttgaaa taaccaaatt gtctctccaa 
cagcttttat caagaattta ctctctgacc 
agccagattc agattttacc catggggata 

45 

aactacttgc atcataaata tataataact 
tatagacact ctcagaatat catacttgga 
50 caacagctgc cctacttttc aattcagata 
cctagggtga agtcactata atctgtagtc 
gctgccagat tgtggcaggt aaagagacaa 

55 

atttttagcg accactagtg gaagacattc 
tttctgtcta tgtaaagttc tcaaaatttg 
60 ttttctgcaa cagatgattc caacatgggt 



accaaagaat 


gaagaagaca 


ttgaaattat 


10260 


agagacgaac 


tttccagaac 


ctccccaaga 


10320 


ctctccttaa 


gtgatttctt 


ctgttttctg 


10380 


ttccaagaga 


catgctgact 


ttcatttctt 


10440 


tatctggcct 


ttgcatggag 


tgaccatagc 


10500 


gtagccattg 


tagcagcttg 


tgttgtcacg 


10560 


aagaaaggca 


gaatgagtgc 


ttcagaatgt 


10620 


ctttttagta 


tagtattttt 


ttttgtcatt 


10680 


tgatggaaaa 


gatatatgac 


tgcttcatga 


10740 


atctacgttt 


ttggtggagt 


cccttttgca 


10800 


acaactaggg 


acaatacaga 


acccattcca 


10860 


acattcttag 


agttaccaca 


ccccatgagg 


10920 


ttttgtaaaa 


acagcatagc 


ttatacatgg 


10980 


ccactctagg 


ctattgtttg 


catgtctacc 


11040 


aaatgaccat 


agaaaatgcc 


accatgaggt 


11100 


gttatattta 


taatttccag 


atgacaaagt 


11160 


atgatcaaga 


aagaatccct 


atctctattt 


11220 


gtaaacaaga 


aaaggtaact 


tgtcaacagt 


11280 


ataaccaggt 


cttcttactc 


aaatcctgtg 


11340 


tgtctgcata 


aactgtgaga 


gccaagtcaa 


11400 


agcaataaac 


aagcactgag 


agacacagag 


11460 


aaaagactca 


gactttcacc 


acatttggaa 


11520 


ggtagtttat 


atgaagcaga 


cactaagtgc 


11580 


aacaatgtaa 


ttaaaatgcc 


gaatctgagt 


11640 


tactagtacc 


ttacctagaa 


ataatgttaa 


11700 


tattatttgg 


gcatttgcta 


catgatgagt 


11760 


tgtaatttgc 


actccctatg 


atatttctac 


11820 


cccaaaatta 


gaaaaaaagg 


agatagaaga 


11880 


ttctaaatta 


ataaaactat 


ctttgtgttc 


11940 


gtttgtctat 


tcttctttac 


tcttgaaaca 


12000 
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ttagaccatg 


ggaggctctt 


acagccttga 


gttgatattt 


atacaaccca 


aatctaggtt 


12060 


tgaacggtga 


ggtgtcaggt 


catcaaatat 


tcatgtctat 


atagtcttac 


acaggttctc 


12120 


aaaaaaaatg 


ttcatgggat 


aggtcattga 


taatggattc 


cttattctga 


gaactccaga 


12180 


cgactgaaat 


atatgagaga 


aggaaaagga 


catagtagga 


gcaggcctga 


gaaaaaaatg 


12240 


aaagtcagaa 


atctttaaaa 


aaatacaaga 


tcttatttct 


atcttatttt 


ttctcctctt 


12300 


ctgaaatata 


tatgaggatt 


cctctccaaa 


cccatggttt 


ctctaagaat 


tttgagtcat 


12360 


ttgtatgacc 


tcaaataatt 


agttttagct 


gacctcacat 


aactccttat 


aataggagac 


12420 


atctttaatg 


tctgctatta 


aagaaggatg 


aaaattccta 


tgaccttctc 


cccgattatc 


12480 


cctttggcta 


tatagagtca 


aataataaca 


ttgaccaata 


gtaaacatgc 


tttgccaaga 


12540 


agtagaagat 


atattctcta 


gccttagttt 


ttcctcccaa 


tttgcatttt 


tgtaaaaata 


12600 


atgttgtatc 


cacaaaggaa 


ataaacttta 


aaaacccaag 


tgcactcatt 


gactaggagg 


12660 


ctgacgtaac 


ttattttcat 


tttcttctgg 


ttttcttata 


atctttgttt 


aaaataaggt 


12720 


gataaaccat 






cicraattact t 

y y y o i_ v— 


afct fcaaaaaa 


ttttaatcct 


12780 


ggacttatct 


tttatagtgg 


caactatgaa 


aagtatcacg 


gggaaataaa 


catcactgtc 


12840 


ctatgtgcca 


cacccagaca 


gggttacaga 


tttggaaaac 


atgtacgtaa 


aacagccaaa 


12900 


gagtgctgct 


ctgtcgataa 


gcaaatgaga 


ctgacttgtg 


atgggacagt 


tc 


12952 



<210> 74 

<211> 10100 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . . (10100) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (1918) . . (1973) 
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<220> 

< 2 2 1 > coding_r egion 
<222> (2844) . . (2973) 

<220> 

<221> coding_region 

<222> (3488) . . (3622) 

<220> 

< 2 2 1 > coding_region 
<222> (5944) . . (6000) 



25 <220> 

<221> coding_region 



<222> (6552) . . (6710) 



<220> 

35 <221> coding_region 
<222> (7116) . . (7214) 

40 

<220> 

<221> coding_region 
45 <222> (8710) .. (8805) 

<400> 74 

50 ggaaaccccc cccaaaaaaa aaaaaaaaaa aaaaaagaaa agaaaagaaa aagaaagata 60 

aaatgaaaat tcgtggctgg gcgcagtggc tcactcctgt aatctcagca ctttgggagg 120 

ccgaggtgag tgagtcacct gaggacagga gttcaagacc agcctggcca acatggtgaa 180 

55 

accccttctc tactaaaaat tcaaaaaaat tagcccagtg ttgctgcaca ctcctgtagt 240 

cccagctact cgggaggctg aggcaggaga attgcttgaa tctgggaggc ggaggttgca 300 

60 gtgagccatt gccctccagc ctgggtgaca gagtgagact ctgtctcaaa aaaaaaaaga 360 
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aaaaaaaaga aaatagaaaa ttatcgaggc 
tgttaccttg tctgggaatt ccttccctga 
5 ttttaccctt ctatgtgttc caacatcctc 
catttatctt tgttctattc tgcttgattg 
catctgtgca tgcacagtga taagggcagt 

10 

atgaacgtat aagtgagaaa gttctattcc 
ccaattcctg aagacaccat gggaagatga 
15 ctataccaat "atgtctttta gtatactctg 
agattttctt tttcaccccc cacccccacc 
ttttagtatc ttaaatatta ttgaaagcca 

20 

taaaagagat taaaaccttc tctgaatttt 
gttatggtat ttcaatgatc acatggttta 
25 atccaatgct gtttagattt gctcatggga 
ctcactatcc cattcttgcc actgtaaaga 
gacaatggag tcttcttagc ttttacacac 

30 

cagcctgttg tatttccaag accatctgct 
acggaaagag agagaaagat gatagatatt 
35 actaacacat actcactcac atacaacagc 
agcatgcctt aaacaaggaa gacaagcttt 
agaaatggct gaattgggac acaagtgggg 

40 

ttttctgcag ttctttctca ccttctcaac 
gtaaatcctg ctagtctcag gcaaaattat 
45 tattagtctt tatttagtag acttcttaat 
gatcttaatc agccaagaaa cttatctgtc 
tcaatgtcat gaggtaaccc atttcaactg 

50 

. caccaagtct caatataata atattcttta 
gacacagaaa gtaataggag agcaaatctt 
55 caaggtatta tttttttcta ccctcagtca 
gcttaggaga tgaaacttta ttgatttagg 
gggtatgagg aagctacttg ctgtactttc 

60 

tttttactca gcttgcaaac tggaaataga 
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tatcatttta attttaagtc 


tgctctcaaa 


420 


actctacata tcccctatca 


ggcagaattg 


480 


tggcctccaa catattgggt 


aataatatat 


540 


tggactagaa ggacctggga 


ttttggcatt 


600 


gcctggcatg gaacactaag ttctttttgg 


660 


ttaattccaa tagcaccttg 


tttaggtcca 


720 


cacctgagtt aataaactta gatacaaaag 


780 


tttatgcaaa tatcttagtt 


tcctgcaaaa 


840 


aaaaaaacca cacatgatta gaggtgtgat 


900 


ttcaaggtta tgtttataat 


taaactgaaa 


960 


atctgtactg attttgctat 


ggtctagcta 


1020 


gccctaattc taatttaggc 


atactactca 


1080 


tttgttataa agttcatgtc 


tctcagtgtt 


1140 


tctaagcacc atcagtctaa 


taatttagca 


1200 


tatcaggtat ggttcacagc 


agttagaatg 


1260 


gcagcgaatc atgtaaacag tttccaaaag 


1320 


aaagtcagag atggatctgg 


agatttagag 


1380 


acttcagttt tgggtctatg 


tgtgaaacca 


1440 


tcaaaggtgc aattggataa 


cttctgccat 


1500 


acaattccag aagaagggca 


catctctttc 


1560 


tcctactaaa atgtctcatt 


ttcaggttct 


1620 


gctccaggag tctcaaattt 


tcttatttca 


1680 


ttttctattc atcacaagta 


aaagcctgtt 


1740 


tggcaaatga cttatgtata 


aagagaatca 


1800 


cctattcaga gcatgcagta 


agaggaaatc 


1860 


ttcctggaca gctcggttaa tgaaaaaatg 


1920 


gctctcccac aggagccttc 


caggtaggta 


1980 


cttagtggca ggggaagtca tagtcacggt 


? 04 0 

£. U *± KJ 


catggatcca tctagtttaa 


ttaatatatt 


2100 


catgtggttc tctctccctg 


gagaggaaca 


2160 


ttttctcaca ttagaagctc 


attttctggg 


2220 
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tatgagacag gagagttcat actgtgtatg 
ctgagggaca catatccttc acacatgctt 

5 

ttgattggtc tttataatcc ataagctgtg 
agagtcccat ctagctattc aaagccattc 
10 ggtttgttac tgacaaaata tattcagata 
ataggtaaat gtgtttattt tcagttttac 
gaaagtaaca ctggattaag aataggaaaa 

15 

atctaatcac ttggatagtg ttcaggtgtt 
tagtttcttg gtgattacaa tgaagatttg 
20 aaactcccct ttctgtctgt cgagaatgtt 
tgaattgctt ttaaatttca cagtgtgcct 
gaagtatctt caggcagact attgaagtcg 

25 

acagttttga aaaaagagca ggagttcctg 
agagtaaggg ttgggtctag aaaagaatat 
30 ttcatgagag gtgttaggtc ctttaaaaaa 
gtccttttgg gaagagagaa gcatttcctc 
ggccaacagg agtgaaggat tcataatctt 

35 

tgatgttcaa catgactacg tacataaagg 
attaggtagt gcttgtatca attcttatag 
40 tggatcattt ctcaggacag tctaggacac 
attaaaaaat gatacaatct cgggaaaatt 
tctataggta acacaaattc tgactgctat 

45 

ctctgtactt gatatttcac acattgaggg 
tccattctgg ggagccatat ttgtgagtat 
50 aacataggtt tttctctttc tcagatctaa 
atttataatt cttaattata aattatatgt 
caacaacaaa agattctttt tacaattaac 

55 

tcccatgaga aacctgaatc taatacaagt 
atagtaattg aacctaaact ttcagaatcc 
60 atcaataaag ttatatgata aatacataca 
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tagatctctg gcttctgggt ctgacatgtg 


2280 


ttataaatac 


ttgataaagt aacctgcttc 


2340 


ggatgcttct 


ctgaagatga aaatagtaat 


2400 


cttcattgta 


ttctgtgcac atgaagttgg 


2460 


catttctatg 


ttaaaaggat tgtgagatgc 


2520 


ttgtcaacat agatgaatga gaaagaactt 


2580 


tttggcatgg 


attttgctcc attttgtccc 


2640 


cttggtcagt 


tacttggatg ctctgagctt 


2700 


aattacagga 


tggctttgaa aaaataaaca 


2760 


gcacagggag 


ttacagaatg ttctcatgac 


2820 


gcatttgaag tcttggaaat atctccccag 


2880 


gcctcatccc 


caccactgca tacatggctg 


2940 


ggggtgagtg 


agcctcctcc aactttgact 


3000 


tgagttgcat 


caactgtttt cccacttgga 


3060 


catggtagat 


aaagagttga cactaactgg 


3120 


ataaagactt 


taaattgcta ggacgagaat 


3180 


tatctttact 


tagatgtaaa gaacaattac 


3240 


cgcatggaga 


aaagtattgg ccttccatgc 


3300 


tggctagggt 


atcctggaaa atcttacgtg 


3360 


taacgcagtt 


tctcatgttt ggcttctatt 


3420 


tttttgattt 


tcatgaaatt catgtgtttt 


3480 


gatatgcctt 


tgttttggaa cagttgtctg 


3540 


agacattttt 


tcatcattta aagcaggtta 


3600 


atatctataa 


ttgtttctga aataacactg 


3660 


ccagttgttt 


attcccagta ttaacatgat 


3720 


gagcatatat 


aacatagata tgctcattaa 


3780 


ggtgggttaa 


acatttagcc cacagtttta 


3840 


taaatgactt gcctaagggc cacttgacta 


3900 


aactccagga 


acatacttct agcactattc 


3960 


actttatctg 


tcaactaaaa ataacaacaa 


4020 
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aggctgggca tggtggctca cacccgtaat 
atcacctgag gtcaggagtt tgagaccagc 
5 taaatataaa aaattagctg agtgtgatag 
gctgaggcag gaggcttgtt tgaacctgga 
ccattgcact ccagcctggg caataagagc 

10 

aatagaaaat aaagttgtct tcatgaaaaa 
ttaagatcaa agggcatatg gtgaccttct 
15 cctggtatct cttttactca tcgttctatc 
aggtatataa ctttttaaag gtctaattga 
acaatttgct aaccttgaca cacatagaca 

20 

gtatgtatcc atcatttcca aaagcttccc 
tccactcctt gtcctctcgt tcccaagaaa 
25 aacttacatt ttttagagct ttatataagt 
tgcacagtta ttttgagatt cttcaagttt 
atatatttta attctagact atgtcacatt 

30 

cagatgactt gttcaggact ctccttgaac 
ctactgtctt tttatttcaa acacagctta 
35 atctgagtct ttaggagatt gctttgaatt 
tgtaaatgtt tggtaaaaat atcatcatgt 
atatatgtca gactctggaa atatgcatgc 

40 

tgatggaaga cgcacataca aatagacgat 
attcatgccc tgatgttgct ggacgttgct 
45 ttagcatctc ctggaagcct catgtaaatg 
gaataagaat ctgcatttta gcaagaccct 
tttttagatg gagtctcact cttgtcgccc 

50 

cactgcaacc tctgcctccc gggttcaagt 
gggactacag gtgcatgcca cagtggctgg 
55 tttcaccatt ttggtcaggc tggtcttgaa 
gcttcccaaa gtgctgggat tacagacatg 
catttaattc tgagaagcac tctatagaaa 

60 

tgacattaat aagtaacttt atcgagtacc 
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cccagcactt tgggaggctg aggcaggtgg 4 080 

ctgaccaaca tggtgaaacc tcatctctac 4140 

tgcatgcctg taatcccagc tacttaagag 4200 

aggcagaggt tgcagtgagc tgagattgtg 4260 

gaaactctgt ctcaaaataa taataataat 4320 

tgaggaaaga gattgctggg gtgagaaaca 4380 

atgccctaga aactctttta ggtattttct 4440 

tggaaaaata ggtggatgag tgagataata 4500 

catataataa attgcaagta tttcagatgt 4560 

cacatgaaaa catcaccaca ttaatacaat 4620 

tgtgtatctt tgtaactctt tcttcctccc 4680 

acattgatct gctttctgtg aatataaatt 4740 

atgttctctt tactgtttgt cttccttcgc 4 800 

tttctttata tcgatacttc attcacaaga 4860 

gactttgtag tctgctaaat ccttagtgct 4 920 

ctgtacctct gttataattg aaacttgtct 4980 

tgaggtgtct ctcaacccat caaactcaca 5040 

tgtgctattg acttatattt atatcaaata 5100 

acattttcat aattactcta tattcacatg 5160 

cacagacacg tgtttcttgc ctaaaggggc 5220 

tgcaatagaa tgagagtggt ggtctaatcg 5280 

actccaagag taacccctgc attgtcaggg 5340 

aagaatttca tgctccatcc aggacctaat 5400 

catatgattc atatacactt tttttttttt 5460 

aggctggagt gcaatggcat gatcttggct 5520 

gattctcctg tctcagcctc cctagtagct 5580 

ctaatttttg tatttttagt agagacaggg 5640 

ctcatgacct ccggtgattc ccccgcctcg 5700 

agccaccaca cccagcctta ttcgtataca 5760 

ataagaataa gaaaatattg ggctcacagg 5 820 

ccaaatgtta cctatgtttg gaagatgggg 5880 
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ttaaaaagga 


cacattgaaa 


acaagaaact 


5 


cagttttcta 


tttctggaat 


gttgtcaatt 


gtgagttgcc 


cgtttctgtc 


tttgtccatc 




gagtcttaag. 


ggaaacacat 


ctttgtctcc 


10 


ttgtttcaat 


ctattttgtg 


tcctaaggct 




gaaacataaa 


ttgtaccatt 


taacatacaa 


15 


taaaaaaagt 


gtcctcacac 


cggcattgtc 




taattagttt 


tcaattaagg 


agaatatttg 




tgaaggcagt 


ccagtgagaa 


agtaatatat 


20 


tttccctttt 


taaccaaaat 


ttggaagcaa 




aaatgaagat 


aggtttattg 


aatgtgccag 


25 


attattatca 


ggtgagagga 


agcctgggag 




cgggaattac 


catcctgatc 


atcaacctga 




gttgccagaa 


attttttgag 


accaagtgct 


30 


ttttttgtgt 


gggaagacta 


agattctggg 




tgttccatga 


acaccatcct 


tttctgtaac 


35 


ttcacacagc 


ccagggagat 


gctggctgcc 




ggttaaagtt 


atctactgca 


agtgacgatc 




tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgta 


40 


tagggagatg 


aaaacaggag 


atgctataag 




actctttttg 


tttgtcattt 


gttgctgttc 


45 


tcaccattct 


gggacttggt 


agtgctgtgt 




tcaaaggaaa 


caaggtagat 


agaagcccga 




tggagcttta 


ttccttaaaa 


atatggcctg 


50 


gtttctccaa 


gttttattac 


tttggtttac 




ccataagtag 


cacacatttc 


ataatatatc 


55 


tgtataatgt 


tagaaaatgc 


cttaaagtgt 




cttgtgagga 


aaccacccag 


gtatctggcc 




tcctgtttct 


gttttattcc 


tttgtttctt 


60 


tcagttttct 


ttcctttctg 


attcatgact 
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cattgtggct tttttttcct cctttttgaa 5940 

atatctgaaa ggagaaatgc aacatatctg 6000 

cttgaaaaga taagaagaac agagttttaa 6060 

tatattactt gtgaatgtgg atatatgatt 6120 

ttttgcaaca gaagttggat ataatcatta 6180 

tgaagtttat gtttaccttg acgtttcttc 6240 

cttgtaggca tattcacatg atcaaataaa 6300 

aggaaagacc gtacgtgttc atgtggttcc 6360 

gctcattaaa caatgcggac attttcaggg 642 0 

tgtggaattt actggatgca tccagccctg 64 8 0 

caagtgcagg cccaggtctg agtgttcttc 654 0 

caaacactgc cagcagcata gctgggggaa 6600 

agaagagctt ggcctatatc cacatccaca 6660 

ttatggcttc cttttccact gtatgtattt 6720 

tcctaatgta agtaagaagc cctcttctcc 6780 

ttctattaca cagtatagtg gttctgtaag 6840 

cactcccctc aacccaggca aattcctcgg 6900 

tctgggtttt tctgtgcctg tgtttgtgtg 6960 

tgtgtcactt taaaaggact ggtcagatgg 702 0 

aaaataaact tttggggcga ataccatgtg 7080 

aataggaaat tgtagtgatg atgctgtttc 7140 

cactcacaat ctgtggagct ggggaagaac 7200 

tataaaatct tgaatgacag gttaacgaat 7260 

ggttttctga aacatttctt ccagaaaata 7320 

aaatctcaca ttttaaatca cattttatac 7380 

cctctgaatg agggttggga taataggatc 744 0 

gtggagcatg agagatggat gtgcagaagg 7500 

ttgttttctg ccccagaagt agccgcctat 7560 

gacttttcct ttccaacttg ctctaaaacc 7620 

accaaatgtt tccacttgcc tcacccgtcc 7680 
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attacacctt 


tgataagaac 


caccagcacc 




gaagaaacat 


actctctcca 


tctgtccact 


5 


taaaatcact 


ctcctccagg 


ctgggcacgg 




gaggctgagg 


agggcggatc 


acttgaagtc 


10 


aaaaccaaat 


cttcaattaa 


aaccaaatct 




aaacaaaaca 


acaacaacaa 


aaacagaaaa 




cctgaggttc 


cagatacttg 


ggaggctgaa 


15 


aggttgcagt 


gagccgagat 


catgccactg 




cccagcacat 


tgggaggcca 


aagctgaaga 


20 


ctggccaaca 


tggtgaaact 


ccgtctgtac 




gcacacacct 


gtaatttcag 


ctacttagga 




gaggcggaag 


ttgcagtgag 


ccaagatcgt 


25 


tgagattctg 


tctcaaaaaa 


aataaaagaa 




ataaataaga 


cagcagatat 


actaaggaat 


30 


atttcttttg 


ggcccacttg 


gccagctagt 


aatcagagtt 


taatgacaga 


gagcgtgaga 




ctcttgagcg 


agacttctag 


ggatgggaaa 


35 


ccttatcagg 


ttccagagga 


tcgtgtttat 




agtgagttgg 


aagacccagg 


ggaaatgtct 


40 


cagaacactc 


tgattcacag 


ccaaggatcc 


ggaaaaattt 


ctattctctc 


cacagcctgc 




agaatatttt 


gtttctgctg 


cttctgtcca 


45 


gatagactcc 


tatttttctt 


gttttatatt 




acatgtagta 


agcaagattt 


aactgtttga 


50 


cggtggttga 


aacttgagtt 


aaacctttga 


tttttttttt 


ttagcattct 


taatagttac 




acccacacag 


tcacagtcac 


acacacatat 


55 


^LaciaLaLL i_ 




^ctctt-ctd l_clCcL 




tctgtataat 


tgcctgaatc 


accagcacat 


60 


aagtaggaag 


tggggaattt 


tattaagtta 


aacagggaaa 


ttataagtgc 


agagattaac 
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ttgtgctcat 


gtacttgccc 


atgtctgatg 


7740 


ttcctgaggc 


attcaagtct 


agccaccttt 


7800 


tggctcacgc 


ctgtaatctc 


agcactttgt 


7860 


aggagttcaa 


aaccagcctg 


gccaaatggc 


7920 


taaaccaaat 


ctctactaaa 


aaatacaaca 


7980 


ggaaacatta 


gcccagcgtg 


gtggcaggta 


8040 


gcaggagaat 


cgcttgagcc 


caagagatgg 


8100 


caccacagcc 


agggtgacag 


agccatactt 


8160 


ataatttgag 


gtgaggattt 


ggagaccagc 


8220 


taaaaatata 


aaacttagtg 


gggcatgggg 


8280 


ggctgaggca 


ggagaattgc 


ttgaacccgg 


8340 


ggccactgca 


ctccagcctg 


ggtgacatag 


8400 


atttaaaaaa 


tcactctctt 


ccaaagatag 


8460 


aacctcacca 


acttgtcatt 


gactgacatg 


8520 


ctggtttggt 


tttctggaaa 


tgaaagaaat 


8580 


cccagaaaga 


caaaagtaga 


tgaggtaagt 


8640 


tttgtggtga 


ttgatatgaa 


atgatttttc 


8700 


gaagaattaa 


acatatattc 


agctacttac 


8760 


cctcccattg 


atttataaga 


atcacgtgtc 


8820 


agaaggccaa 


ggtcttgtta 


aggggctact 


8880 


tggttttaca 


ttagatttat 


tcgcctgata 


8940 


ccttaatatt 


ctccttctat 


ttgtagatat 


9000 


atgaccacac 


acatctctgc 


tggaaagtca 


9060 


ttataactgt 


gcaaatacag 


aaaaaaccca 


9120 


ccgcttgaaa 


aaatttggtt 


tttagggttt 


9180 


agttgggcat 


gatttgtacc 


atccacccat 


9240 


gtattactta 


cactatatat 


aacttcctat 


9300 


tttttarcaa 

W Kr b» k> V» ^ CL 


y c* k_ d uy day i_ 


ft*t"at"aa ana 
\- \- i_d uddctya 


7JDU 


tcactgacat 


gatattattt 


gcagattgac 


9420 


ctcgttgtct 


ggggaggtaa 


ataggttaaa 


9480 


atttcacaaa 


tgtttagtga 


aacatttgtg 


9540 
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aaaaaagaag 


actaaattaa 


gacctgagct gaaataaagt gagtggaaat ggaaataatg 


9600 


gttatatcta 


aaacatgtag 


aaaaagagta 


actggtagat 


tttgttaaca 




9660 


taaagttaga 


caagcaactg 


gttgactaat 


acattaagcg 


tttgagtcta 


agatgaaagg 


9720 


agaacactgg 


ttatgttgat 


agaatgataa 


aaagggtcgg gcgcggaggc 


tcacgcctgt 


9780 


aatcccagcc 


ctttgggagg 


ccgaggtggg 


cagatcacga 


agtcagtagt 


ttgagaccag 


9840 


cctggccaac 


atagtgaaac 


cccgtctcta 


ctaaaaatac 


aaaaaaaaaa 


ttagctgggt 


9900 


gtggtggcag 


tcacctgtag 


tcccagctac 


ttgggaggct 


gaggcaggag 


aatcgcttca 


9960 


acctgggagg 


cggaggttgc 


agtgagccga gatcgcacca 


gtgcactcca 


gccttggtga 


10020 


caatgggaga 


ctccatctca 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaataaaa 


atttacaaat 


10080 


tttaaaatgg 


ggggaaaatt 










10100 



<210> 75 

<211> 22884 

<212> DNA 

<213> homo sapiens 

<220> 

<221> genomic_DNA 

<222> (1) . . (22884) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (12238) . . (12393) 

<220> 

<221> coding_region 

<222> (13544) . . (13681) 

<220> 

<2 21> coding_region 
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<222> (15338) . . (15394) 
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<220> 

<221> coding_region 

<222> (18027) . . (18188) 

<220> 

<221> coding_region 

<222> (19594) . . (19695) 

<220> 

<221> coding_region 

<222> (20223) . . (20249) 



<400> 75 
tttttgttct 


gccatagttt 


actgagaatg 


atgtttccaa 


tttcatccat gtccctacaa 


60 


aggatatgaa 


ctcatcattt 


tttatggctg 


catagtattc 


catggtgtat atgtgccaca 


120 


ttttcttaat 


ccagtctatc 


attgttggac 


atttgggttg 


gttccaagtc tttgctattg 


180 


tgaatagtgc 


cgcaataaac 


atacgtgtgc 


atgtgtcttt 


atagcagcat gatttatact 


240 


catttgggta 


tatacccagt 


aatgggatgg 


ctgggtcaaa 


tggtatttct agttctagat 


300 


ccctgaggaa 


tcgccacact 


gacttccaca 


atggttgaac 


tagtttacag tcccaccaac 


360 


agtgtaaaag 


tgttcctatt 


tctccgcatc 


ctctccagca 


cctgttgttt cctgactttt 


420 


taatgattgc 


cattctaact 


ggtgtgagat 


gatatctcat 


agtggttttg atttgcattt 


480 


ctctgatggc 


cagtgatgat 


gagcatttct 


tcatgtgttt 


tttggctgca taaatgtctt 


540 


cttttgagaa 


gtgtctgttc 


atgtccttcg 


cccacttttt 


gatggggttg tttgtttttt 


600 


tcttgtaaat 


ttgtttgagt 


tcattgtaga 


ttctggatat 


tagccctttg tcagatgagt 


660 


aggttgtgaa 


aattttctcc 


cattttgtag 


gttgcctgtt 


cactctgacg gtagtttctt 


720 


ttgctgtgca 


gaagctcttt 


agtttaatga 


gatcccattt 


gtcaattttg gcttttgttg 


780 


ccattgcttt 


tggtgtttta 


gtcatgaagt 


ccttgcccat 


gcctatgtcc tgaatggtaa 


840 


tgcctaggtt 


ttcttctagg 


gtttttatgg 


ttttaggtct 


aacgtttaag tctttaatcc 


900 


atcttgaatt 


aatttttgta 


taaggtgtaa 


ggaagggatc 


cagtttcagc tttctacata 


960 


tggctagcca 


ctttccccag 


caccatttat 


taaataggga 


atcctttccc cattgcttgt 


1020 
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ttttctcagg tttgtcaaag atcagatagt 
ctgttctgtt ccattgatct atatctctgt 

5 

ctgtagcctt gtagtatagt ttgaagtcag 
ggcttaggat tgacttggtg atgcaagctc 
10 ttttttccaa ttctgtgaag aaagtcattg 
aaattacctt gggcagtatg gccattttca 
gaatgttctt ccatttgttt gtatcctctt 

15 

ccttgaagag gtccttcacg tcccttgtaa 
aagcaattgt gaatgggagt tcactcatga 
20 ataagaatgc ttgtgatttt tgtacattga 
ttatcagctt aaggagattt tgggctgagt 
catctgcaaa cagggacaat ttgacttcct 

25 

tctcctgcct aattgccctg gccagaactt 
gagggcatcc ctgtcttgtg ccagttttca 
30 gtaagatatt ggctgtgggt atgtcataga 
caatacctaa tttattgaga gattttagca 
ttctgcatct gttgagataa tcatgtggtt 

35 

aaaaaaaacc ctgtcaaaaa gtgggcgaag 
atttatgcag ccaaaaaaca catgaaaaaa 
40 caaatcaaaa ccacaatgag ataccatctc 
tcaggaaaca acaggtgctg gagaggatgt 
tgggactgta aactagttca accattgtgg 

45 

gaactagaaa taccatttga cccagccatc 
aaatcatgct gctataaaga cacatgcata 
50 gcaaagactt ggaaccaacc caaatgtcca 
cacatataca ccatggaata ctatgcagcc 
gggacatgga tgaaattgga aatcatcatt 

55 

aaacaccgca tattctcact cataggtggg 
aaggggaact tcacactctg gggactgttg 
60 taggagatat acctaatgct aaatgacgag 
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tgtagatatg cggcgttatt tctgagggct 1080 

tttggtacca gtaccatgct gttttggtta 1140 

gtagcatgat gcctccagct ttgttctttt 1200 

ttttttggtg ccatatgaac tttaaagtag 1260 

ataacttgat ggggatggca ttgaatctat 1320 

cgatattgat tcttcctacc catgagcatg 1380 

ttatttcatt gagcagtggt ttgtagttct 1440 

gttggattcc taggtatttt attctctttg 1500 

tttggctctc tgtttgtctg ttattggtgt 1560 

ttttgtatcc tgagactttg ctgaagttgc 1620 

caacagggtt ttctagatat acaatcatgt 1680 

cttttcctaa ttgaataccc tttatgtcct 1740 

ccaacactat gttgaatagg agtggtgaga 1800 

aagggaatgc ttccaatttt tgcccattca 1860 

tagctcttat tattttgaga tatgtcccat 1920 

tgaaggttgt tgaattttgt caaaggcctt 1980 

tttgtctttg gttatatcaa atttacaaga 2040 

gacatgaaca gacacttctt aaaagaagac 2100 

tgctcaccat cactggccat cagagaaatg 2160 

acaccagtta gaatggcaat cattaaaaag 2220 

ggagaaatag gaacactttt acactgttgg 2 2 80 

aagtcagtgt ggcgattcct cagggatcta 2340 

ccattactgg gtatataccc aaaggactat 2400 

cgtatgttta ttgtggcact attcacaata 2460 

acaatgatag actggattaa gaaaatgtgg 2520 

ataaaaaatg atgagttcat gtcctttgta 2580 

ctcagtaaac tattgcaaga acaaaaaacc 264 0 

aatggaacaa tgagaacaca tggacacagg 2700 

tggggtgggg ggagggggga gggatagcat 2760 

ttaatgggtg cagcacacca gcatggcaca 2820 



-104- 



WO 02/062946 



PCT/US01/48437 



tgtatacata tgtaactaat ctgcacattg 
taataataaa ataaaaataa agtctctctt 
5 tagaactcaa tggttacctt tcgtgaggag 
agctgtaata tttcacatga agcccatctt 
aatcccatta aaatggctgt caaaaatgaa 

10 

ttacacccta aaataaagta cataggccac 
aatgcatttg ttttttccta tcttatttcc 
15 tacccagtag ttctacaatc tggcttcaca 
tttgtgttgg gctgagtgca gtggctcatg 
gcgcgtggat tacttgaggc caggagttca 

20 

gtctctacta aaaaaaatag aaaaattaac 
cctactcagg aggctgaggc acaagaatcg 
25 gccgagattg tgccatggca ctccagcctg 
aaaaaaaagt tcaccttcat gttgggttca 
acttcccaga agattccact aggtagttga 

30 

ctaggctatt tccatgttta ccaataaatg 
cctctaaaac aacagtgcct tcattcaaat 
35 cgtttatttg aaaacacagc atgaggtcaa 
tgttggtttt attttatttg atggctcatt 
gtccgtctta atcttttatt aatcttacaa 

40 

ttcaacttgt gacttttgca acattataaa 
gcaccccaaa gtcaaaaacc ttaggtggtt 
45 gatatataaa ggtactgctt catgaagaat 
agtgtgtatg ggggggtaaa acagacagtg 
ccagcaagtc ttgagaaata tattgctttt 

50 

ctagactata ggtgaattca ctcgaagatt 
gttttgaact tgattattct ttaaggtatt 
55 gatcatgtct ccttgccatt gggaaatatt 
ttctattttt ccctttttaa tttgtttaaa 
cagccataaa aaagaataag atccagtcat 

60 

ttctgtcaag tgcaataagc caggcacaga 
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3000 
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3060 
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ayawvaycui. 


yy LCdatduy 
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tgggcatggt 


ggtgcgcccc 


LyLddCCCCd 




ccugaacc eg 


99 a 99 ca 9 a 9 


gcugcaguga 




ggcaacagag 


LdayaCcytc 


LCdddddddd 


■5£ftft 
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CCCLtCCayC 
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yCUddydtLy 
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CCaLLLLCCL 
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L Id Ldy LaaL 
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ddy Ly LLLLd 
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dy ay LLLdLd 


LdLyLy LLdd 




nf-na f- 4- t-aa 
y LyaLLLLda 


hahrtharapa 
LdLy LdLdLd 


LdLyyyLdLy 


a 9 nn 


csaacaataa 


tatttaataa 

l- d t_ i_ i_- ay a a 


atacrataaora 


4260 


Lay H-daaLy 


nnnr , a t~ rrt~ 
yyyL-aw>Lwy L 


a aa hflaaf art 
aaa LyddLay 


*± -5 U 




L L LyydddL L 


Ly L LdL l Ly d 


a ^ ft n 


traacartaa 
uv»ciy wflu i#ya 


aaactaatca 
aay l. Lay l l a 




4440 


gcatgcaagt 


ggaaaaaagg 


tctatgaatc 


4500 


aaatttttta 


atttatttat 


ttttatttat 


4560 


ttgcaacaac 


atgaatggaa 


ctggagatca 


4620 


aaaacaaaca 


tcacatgttc 


tcacttgtgg 


4680 
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gatctaaaaa ccaaaacaat tgaactcatg 
ggctgggaag ggtagtgggg gcctgggagg 

5 

aatagaataa ataagaccca ctatttgata 
tacattttta agataaagag tgtaattgga 
10 ggggatggat accccattct tcatgatgtg 
tgtctcatgt acctcataag tatatacact 
gccgggcgtg gtggctcatg cctgtaatcc 

15 

cacgaggtca agagatcgag acaatcctgc 
aatacaaaaa ttagctgggc gttgtggtgt 
20 gaggcaggag aattgcttga acccggaggc 
tgcactccag cctggtgaca gagtgagact 
ataattttta tgtatttatt ttactgttcc 

25 

acatatgttc tgcagatcta atgaacagca 
cttgaagttt gttaagagag gagatcttaa 
30 aaagatgatt atgtgatctg atgtgatcag 
caatgcacac atgtgtcaaa acatgttgta 
aatcttcctt tgaaagcagt ttctcacaat 

35 

atcattttta ttatttactt tgaactcatg 
gcattctgtc atttatactt tataccccaa 
40 gggagaaccg aagtcttaga gtttaagtaa 
aattcaaaaa gagcatgttg aaagcatcca 
ctcttcttcg ctgaaggcat tttcttgtca 

45 

tggttctgga taaaacgggg agggaaggat 
tccttctgac ttctgtctct gtctctgtct 
50 aaatgtagac cctggagagc cgctgcccta 
cacctgtgca gaaattttct ggcaactggg 
cccacacact atgttttggt cttggtaact 

55 

cccactgacg ctcctgtttg taggactagg 
tctgtctcac agactcccct ggggcatgga 
60 ggcctggccc tgtgaccttc ataaggattg 
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aagacagaaa 


gtagaaggat 


ggttaccaga 


4740 


ggaggtgggg 


atggttaatg 


ggtacaaaaa 


4800 


gcaaaacagg 


gtgactgtag 


tcaataactg 


4860 


ttgtttgcaa 


tgcaatggat 


aactgcttaa 


4920 


agtattatgc 


atttcatgcc 


tgtatcaaaa 


4980 


aggtacttac 


aaaaatggaa 


aaaatagggg 


5040 


cagcactttg 


ggaggccgag 


gcaggtggat 


5100 


ccaacgtggt 


gaaaccccat 


ctctactgaa 


5160 


gtgcctgtag 


tcccagctac 


tcgggagtct 


5220 


ggagcttgca 


gtgagctgag 


atcgcgccac 


5280 


ctgtctcaaa 


taaatacata 


aaaaataaaa 


5340 


cccctcagct 


ttaatgaggt 


acagtcagat 


5400 


tgataactga 


agttaatact 


gtactatgta 


5460 


atgttctcac 


caccaccacc 


aaaaaaaaaa 


5520 


tagattatct 


gctttgtgtt 


agtgacttca 


5580 


catggtaagt 


atattaaatt 


tttatttgtg 


5640 


cattctctaa 


ggaattgaag 


aaagagctca 


5700 


tgaagatctt 


agttattaag 


acctttacat 


5760 


agtgggcact 


atgatcttca 


acttacaaat 


5820 


cttcctttca 


tggctttttc 


tacagctcca 


5880 


ctctttcctt 


tcatttcttc 


agatttcttt 


5940 


tctctttctt 


cttcagtatt 


ccatttgcaa 


6000 


cttatcttta 


ccaccttcct 


tttggctatt 


6060 


tctctttcat 


cagtcccttc 


cttcgctcct 


6120 


gggctcaggt 


gccagctctc 


ttagcagaag 


6180 


cccctgctca 


tttcatattc 


acaatctctg 


6240 


ggccctcttt 


ctctttcctg 


tgatcacgtc 


6300 


acctcgcttt 


ccttgctgtg 


aacattcttc 


6360 


tcaggggctg 


tgccctcatt 


ctttctctca 


6420 


tggtttgaat 


cttcacgtta 


cctacttcct 


6480 
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tactccttgt cacttggctc tgatctttct 
aagcatataa tttggccact tttgaaactg 
5 gtaatcccag cactttggga ggctgaggca 
ccagcatgac caatacggtg aaaccctgtc 
tggtggtgca tgcctgtagt tgcagctatt 

10 

cccgggaggc ggaggttgca gtaagccaag 
agagtgagac gccatctcaa taaacaaacg 
15 aaaaccaaaa aacaaaaaag aaactgaaag 
aggcacactt cgggcttcac cattgtacaa 
taaattttag gaaagctggg taacatttct 

20 

gttaatagta gtacctgggt cctgggttat 
agcattcaga acaatgacct gtatgtagta 
25 attattatta tatctgtgat attgggcaaa 
gaatgtcatc acgcatataa ggtacatagc 
gcctgagtga ccacctccct cctccagatg 

30 

tgctatgcac ctaacactgt tctacctcca 
tattgttgct tgtgtttcct gttccagcaa 
35 cagttatccc tcacaccagt ttcctaatca 
agcgaggccc caaatgcaga tgtgattagc 
caccaagtcc tctaaagtgt tgcataaggc 

40 

gttatcggga aaagatgctg tagtgatctt 
gacttgggag gaaagccgtc tgccaaagcc 
45 atagacattc ccctttcttt tatatctttc 
atgtctgcaa aggtctctgt ttggttgcta 
atggcttctc tttaggatgt gatgattcct 

50 

ataatacagc aatttctttg ggaaaatgtg 
tcttaggtac attaaatccc actggctgca 
55 ggggtggtct cgatatgcct gaagtatggg 
gggacagtgg agtgtaatga tagcttgagg 
tgtgtgggct tgggaaagct acttaacttc 

60 

ggaaaggatc atagcagtag tacccattca 
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attgcctatt ttaggacatt tatttaggat 6540 

aaaggaggcc gggcgcgatg gctcatgcct 6600 

ggtgaatcac cggaggtcag gagttcgaga 6660 

tctactaaaa atacgaaaat tagctgggag 6720 

caggaggctg gaacaggaga attgcttgaa 67 80 

attgcaccat tgcattccag cctggatgac 6840 

aaacaaacaa acaaatacaa aaaacctaaa 6900 

9 a 9999tttt aacaatcata ttaggataac 6960 

aacagagatt ttgcacaggg cttttccagt 7020 

ataccatagt tttctctacc ataacatcgg 7080 

tgtgaggatg agatgaatta acatatgtaa 714 0 

accatttaat acatttaaac aataattagc 7200 

gtaattaacc tctctaagca tgtaagaatt 7260 

accgttgctg gcacatcatg taaaataagt 7320 

cttctcttac agtgtgactt tgtctcttct 7380 

acccacaacc tcttcccact tgtctaccca 7440 

tgtgttaagc aacatctgaa agccagaaga 7500 

tcactaagcc ctggggaaac acaatggatc 7560 

acactggtca gtcagcccgc aaaacacttt 762 0 

aaagataacg ttaagtctgt ggaggcttcc 7680 

ttctgagtgt ctcctacttg cgacaaggtg 774 0 

tgaagcctcc aaggtataga ttccctctcc 7800 

gttaaaatgc ctcccagtgg actactgggg 7860 

tgtagaggtt tggaacattt aactgtgaac 792 0 

ttggagccat gataaaaagt gttagacata 7980 

ggtatggata cacacacatg cacacgcacc 8040 

cctgagtttg tctgccagct ctgataatga 8100 

gtctgcaaag tgcattttag catagctcag 8160 

ttctgatttt tagcttctca aattgctggc 822 0 

tttgttcctc agcttcttca tctgttagat 8280 

tctgatgttg tgagaactaa attagtaaac 834 0 
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atggataaag 


cccttagaac 


aaatgcctgc 


cacatactaa 


gagctggaga 


gatgaactta 


8400 


5 


actggcagtt 


tcctagaaaa 


tctacatttc 


tctgattttg 


tgtgtgtgtg 


tgtgtgtgag 


8460 


agagagacag 


agtctcgctc 


tgtcacccag 


gctggagtgc 


aatggcacga 


tttcggctca 


8520 




ctgcaaactc 


cgcctcatgg 


gttcaaacaa 


ttctcctgcc 


tcagcctcct 


gagtagctgg 


8580 


10 


gattacaggc 


gcgtgctgtc 


acaaccagct 


aattttatgt 


gtttttagta 


gaaacagggt 


8640 




ttcaccatgt 


tagcaaggat 


gatctcaatc 


tcctgacctc 


gagatcctcc 


cgcctctgcc 


8700 


15 


tcccaaagtg 


ctgggattac 


aggcatgagc 


cactgcaccc 


ggaccatttc 


tctgattttt 


8760 


ttaaaaataa 


ataactgatt 


taaccaaatt 


ttagaaaaca 


tagtttgttt 


cacctggagg 


8820 




ctgaaaattc 


tccctgatta 


ttttgattat 


ttgaaacaaa 


tcaagttaaa 


taatatggtt 


8880 


20 


agatatttgc 


tctcttcaga 


aatgctcttg 


gagtttcttg 


agatgagagg 


ctccagtgct 


8940 




tgcagaggta 


ttttccatca 


tcatttatca 


ggatggctac 


cttgggctgg 


accaaaagtc 


9000 


25 


tggttactga 


ttgaacatgt 


gtgttaactg 


agtgcttgcc 


atctgatgat 


tctgagctag 


9060 


gcactggagt 


tgcagtgatg 


agtagacaac 


gaggtaacag 


acaagtatcc 


aggcaattaa 


9120 




agctcaggct 


gattaagtgg 


tatgagaagg 


aaagttgagg 


aaattttgag 


agcaacccaa 


9180 


30 


aggggtgcat 


gcttcttcca 


gagttggcca 


ggactaaaaa 


cagcatcctg 


gaagtatgac 


9240 




atgtgggctg 


aaaatcagtc 


tggaagggcc 


gggcgcattg 


actcacgcct 


gtaatcccag 


9300 


35 


cactttggga 


ggctgaggtg 


ggtggatcac 


ctaggtcaag 


agttcgagac 


cagcctggcc 


9360 


aacatggtaa 


aaccccatct 


ctactaaaag 


tacaaaaatt 


agctgggtgt 


ggtggtgcac 


9420 




gcctgtaatc 


ccagctactt 


gggaggctga 


ggcgggatca 


tagcttgagc 


ccaggaggcg 


9480 


40 


gaggttgcag 


tgagctgaga 


tcgggcctct 


gcactccagc 


ctgggagaca 


gagcaaagct 


9540 




ccaactcaaa 


aaaaaaaaaa 


annnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


9600 


45 


nnnnnnnnnn 


nnnnnnnnnn 


nttcagtctg 


gaaggagagc 


agaaattatc 


caggtgaagc 


9660 


ttgaggggac 


ttgtctttca 


ggaatggggg 


atagcacaga 


ggattgaaga 


catttgaggg 


9720 




tttagaactt 


ctgaaggaag 


acaaatcaag 


aaaataaggc 


tggatggagt 


tttgagatag 


9780 


50 


ggatgggtga 


ggggtgaggc 


aggagacgta 


ggcgagagat 


tatctggtca 


taaagcagca 


9840 




cattctccgt 


attgcatgca 


tcttagttca 


ttagatattt 


aatccaaaat 


acacacaaat 


9900 


55 


ttcccagtgg 


ataaatttga 


agtcttggct 


ttttggctat 


tcttatttat 


actttttgca 


9960 


gataattacg 


ttaggaactt 


attcacatag 


agtttgcttt 


aagctttatt 


gatcttccct 


10020 




cgctcttggg 


ttctgtacag 


tgttgaacac 


ggggggttaa 


gggttccttg 


tcatttcaca 


10080 


60 


tttatccata 


cttaaataca 


tacagaatgt 


aatttaatct 


tggcatgaaa 


atcaatattg 


10140 
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gtaggaaact ttgatgaaga gtattggtag 
ttatcagtat tggtagagaa ttttatgaat 
5 acatggcttg actctcactt ttagtcaatg 
gtcatacaca cccccagagt agagtatcag 
ttagcttgac aaaaaattaa gttattagaa 

10 

gagtagaaat taagatacta tactttcttt 
gggtacatgt acaggttcgt tatataggta 
15 ttgtcaccca ggtactaagc ctagtactca 
ctctctgata atctccagtt gttttgttcc 
agctcccact tataagtgag aatatgtggt 

20 

taaggataat ggcttctact tcatccatgt 
tatggctgca tagtattcca cagtgtgtat 
25 tgatggacat ttaggttgat tctatgtctt 
tcacgtgcat gtgttttatg gtagaatgat 
gatagttatg gcagaataca gcatcatttt 

30 

tctccctcat cagttataaa agcatataag 
ctatgaatta ggaagttatc ttgcccaaca 
35 tactttagtg atacatattt ttcagctgta 
tatatataca aacttgaagt cctttttgtt 
ccacctgttc acaggttgag aattccgtac 

40 

ataaaattaa aattattccc atttttacag 
catgcaacta gtaagtgatg aaatctggat 
45 tctgaaaact gaggttccca ttactaggtc 
tgaatttgga tcttccatag tgcttaaact 
gtcttctata tagtactata taatgtggtg 

50 

tgtttttagt aaaatctgaa atgatttata 
ctctctatac ctcaattttt ttcatctgtg 
55 attaagtata ttaatacatt cacagcatcc 
taaataaatg acacctacta ttattactgg 
agaccactgc tcagagaaac catatgtctg 

60 

ggactatata tcagtgcatg caaaatgaga 
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caaaatttta 


tgacatttat 


aataaatgtt 


10200 


ttttatagca 


attcccagtg 


taataataaa 


10260 


cttagtaaat 


ttgtcatcac 


acttaaatga 


10320 


aaaggtaaac 


atcctgaaag 


tggcgtgtat 


10380 


aaatggatct 


agtatataat 


ttgtaacaaa 


10440 


ttcaataaaa 


agcttttatt 


ttaggtttac 


10500 


aattcgtgcc 


acaggggtgt 


gttgtgcaga 


10560 


atagttattt 


ttcccgatct 


cctccctcct 


10620 


cctctatgtg 


tccatttgtt 


cttagcattt 


10680 


atttggtttt 


ctgttcctgt 


attagctttc 


10740 


tcctgcaaaa 


gacatgatct 


cattcctttt 


10800 


gtaccacatt 


ttcttcatcc 


agtctacaaa 


10860 


tgctattgtg 


aatagtgctg 


caatgaacat 


10920 


ttatattaag 


atcttattct 


taacccaatc 


10980 


atgtgtttgt 


ttatggttca 


tctcatgcca 


11040 


cattttacaa 


ctattttgtg 


taggagggta 


11100 


ttcagaagtc 


caatgtaagc 


attctgtgtg 


11160 


atgattgagg 


tggttgtttt 


gtgaccctta 


11220 


agaaggcaga 


gtataagtta 


aaagctgata 


11280 


attatcaaag 


tcaactctcg 


caatgttaat 


11340 


aagtgaaaac 


tgaggttccc 


actacaaggt 


11400 


ttgaattgga 


atcttccata 


gtgcttaaac 


11460 


atgcaactag 


taaatgatga 


aatctggatt 


11520 


ctaccatgct 


atgttggtac 


tcagaaagac 


11580 


atttgaaaaa 


tattggacct 


tagagtagaa 


11640 


agccatatga 


gtttgggtaa 


gtttctcggt 


11700 


aaacgggaat 


aataataata 


ttataacagg 


11760 


agacagtgct 


tggcatatag 


taataagtac 


11820 


aatgtgacta 


aagccattat 


tcttataaag 


11880 


gttgcccttg 


actgacttga 


agaaaggcag 


11940 


aagaaattgc 


tctttaccat 


tccaaagtta 


12000 
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agtgaagcac caaggcctac tacaatgacc 
gaattgaata ttgtacaaac tctgttactc 

5 

gtaatgtatg gacagaattg aacccttctg 
aaagggaaag catgaaggct ttggatttct 
10 gcctcccacg aagttgataa tgcagagctg 
agtgagacgg gaccagaaga gctgaatact 
gattatcaga aagcaaaatt acaagttctt 

15 

ttaagaggga agaaaataaa tacagttgac 
attcccttac atgtggattt cttccacctc 
20 ccctgctctc cctcttcttc ctcagtctat 
ttatgataat ctacttccac ttaatgaata 
cttaacgttt ttagtttctc tagcttattg 

25 

taatcgactg tttatattaa tagtaagact 
gtttttggga agtcataagt tttacgtgga 
30 accccaatgt tgttcaagag ccaactgtat 
gaggaaattc tgagttattt atcactgact 
cttatagatc tggggagcag aaatgcaaag 

35 

ttggagaaca gattccaaga cccttcattt 
aggaggctga gctctatgaa gcttatattc 
40 gaagggttaa aaattgggtt gagagaaagg 
ccttcatcat attatgatgc aaaagcatgc 
ccaatcatgt ctgttttact gtcctgccac 

45 

aaatgtgggg gagaggtgaa gacaaaatgc 
ataaaaaact ctttcagata agctctcgtt 
50 ttcaagcacc gacatctcct ttctgaggaa 
ttagtatatg actgttacgc ctttttcccc 
tgcagcaatg attctggctt tgggtgtctt 

55 

ccaaaagcac ttctttttct tcaccttcta 
tgtgagtatt cgtactcccc tggctatatg 
60 gtttgatgac aaaaatattg tttgtaggtt 
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cgatgatgtc tactactcca 


aggcaagaca 


12060 


tgcctggtgt cccatttgtg 


gagagtcatt 


12120 


taacactcaa aattttagta gctagtggaa 


12180 


tttctatcac agccataaac 


aaccccaatg 


12240 


gggtcagcct ctgcccatgg 


taccccaggc 


12300 


tctgtctacc accccataaa 


tggatcacca 


12360 


gggtaagtca gccttagttt 


aaacactgat 


12420 


ctgtgaataa cgtgcgtttg 


aactgtgagg 


12480 


tgtttgccat cccgagacag 


caagaacaac 


12540 


tcaacgtgaa gaggatgagg atgaagacct 


12600 


gtaaatacat tttctctttc 


ttatgatttc 


12660 


tatataatat gtcacataca 


aaatatgtga 


12720 


tctggtcaac agtaggctat 


tagtagttaa 


12780 


tttaccacca tgaaagggtg 


ggcattccaa 


12840 


catgtagaaa accttgctta 


tgctgagaag 


12900 


taataatgat tggagaaaag acagggactt 


12960 


gcattttctc atggtagaaa 


acatcaaacc 


13020 


tttgttttgt gccttttgac 


cagccatttg 


13080 


tatattttaa atttaaaaat 


tgggttgaga 


13140 


ctgagtagga tatggcttat 


cctctgtctg 


13200 


tgggaagaga ctttgactcc 


atctacctta 


13260 


tttctgagag gagttcgtga gaatgaacag 


13320 


tgccactgga attggtgtgg 


atagcgtgga 


13380 


ctgttaattt aggcttgaca 


ctttaaccat 


13440 


aattaaaatg gcctgttttc 


agtgtccact 


13500 


tttcttcttt caggccatcc 


agatcctgaa 


13560 


tctgggttcc ttgcaatacc 


cataccactt 


13620 


cacaggctac ccgatttggg gtgctgtgtt 


13680 


ttatttctta gataccactg 


ctggagatgt 


13740 


attggaaggt tgcatgctgt 


ggttttgatt 


13800 
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tgcaattctt gatataaaca attagagacg 
aattacttaa ttaatttatt ttttgagaca 
5 cagtggcatg atctctgccc actgaaaccc 
cctcagtctc ctgagtggct gggattacag 
tttttgtatt tttagtagag acagggattc 

10 

tgaccacaag tggtcctccc agcttggcct 
cctgatgcca ggccaaaaat ctctttctga 
15 tgaagtccta aaaattatct tatatataac 
ttgaatgaaa aatgggtatt gatgaattgc 
gagttaagaa atttgtctgc tgattactgt 

20 

tgggtcaaca ttcaatcaga agaaataggg 
attatttatt acttccctga tggttcattt 
25 tttgtatatc ttcttttgag aattgtctat 
tgtttgttta tttcttggtg atttgtttgg 
tgttagatgt atagattgtg aagattttct 

30 

tgactgttcc ttttgccgtg caaaagctcc 
ttgattttat tgcatttgtt tttggcttct 
35 cctaagtgaa gtaactcagg agtggaaaat 
gagctaagct atgaggatgc aaaggcataa 
gggaaaggtg ggaaggaggt gagggataaa 

40 

ttgggtgatg ggtgcaccaa aatctcacaa 
aaacaccacc tgttccccaa taacgtatgg 
45 taaaactaat tgcatggaaa gactaaataa 
tgagcagcat caagtacaat tctgtgtcct 
gactattgct ggatatctgc taattgaggg 

50 

tattccttta tctgttttcc ttcctatttt 
tgttgtagca gggataaaac ccacaagaac 
55 gatcagagga gtagaaatat attattcata 
aactgagaca ctgagtatca tttaatgcac 
ggtaggtcag cagatcacat acattgtgag 

60 

ttggtgaact tgaagcttag aaaaagtctt 
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ataaactctt 


ttttattttt 


taatttaatt 


13860 


aagtctcgat 


ctgccatcca ggctggagtg 


13920 


tctgcctccc 


cagttcaagt 


gattctcttg 


13980 


gcatgcacca 


ccatgcccac 


ctaatttttg 


14040 


accatgtttt 


ccaggatggt 


ctcgaactcc 


14100 


cccaaaatgc 


tggaattgee 


aggcatgagc 


14160 


ttaagagaaa 


ccacatttgc 


caagtacaaa 


14220 


tatatgttac 


atttatagat 


acttaaaaaa 


14280 


cttgtggtac 


agcagagagt 


cctggcttga 


14340 


ctgaaagtca 


gaagaatcat 


ttccattatc 


14400 


gtgattttaa 


aatgtattca 


ttcagttttg 


14460 


ttaaaaagtg 


aatgatatgt 


ttgttggcca 


14520 


tcatgtcctt 


ageccatttt 


ttgatgggat 


14580 


gttcattgca 


gattctggat 


attagtcctt 


14640 


cccactctat 


gggttgtctg 


tttactctgc 


14700 


ttagtttaat 


taggtcccag 


ctatttatct 


14760 


tcgtcattaa 


atecttgeat 


aagecattat 


14820 


caaacatcat 


atgttctcac 


ttataagegg 


14880 


gaatgacaca 


atggactttg gggactcagg 


14940 


aggctacaaa 


ttgtgtgcag 


tgtatactgt 


15000 


attaccacta 


aagaacttat 


ccatgtaacc 


15060 


aaataaaaaa 


aatttaaaaa 


ataaaatgaa 


15120 


ataaataaat 


aaataaataa 


ataaataaag 


15180 


aggatgtgaa 


tttaaaaggg 


aggtttgtat 


15240 


aggaccaaga 


agaaaatggt 


tgagacttaa 


15300 


caaacagttc 


tgtagttcag gaaccttgtc 


15360 


atgggtaagt 


agcacttcct 


ctttttctat 


15420 


aarflctfa era a 


cagtaagtgt 


ccatgaatac 


15480 


aaaatattag 


ttttcttaac gtgttcagat 


15540 


gatgeattec 


tgggatgett 


gctcttaata 


15600 


cttgattttt 


atgtgttgtc 


aagtttgacc 


15660 
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aattctaatt 


atttggaagt 


aagaactccc 


5 


aaacttctat 


tctaaacaac 


tgctaagtct 




cactgatact 


ccactttcct 


actccattca 




ttaccatttt 


attgtatcct 


caaacaccaa 


10 


atgatttttc 


tgtctcttct 


gagaaaactg 




actactaatg 


gactagatct 


gtaccaatgt 


15 


aaattttgtg 


attttcttta 


aagctgatct 




tttccccaca 


caagtgcatc 


actttaattt 




gtctcactct 


gtcacccaga 


tgggagtgca 


20 


accttccagg 


gctcatgcaa 


tcctcccacc 




gtgtgccacc 


atccctggct 


aatttttgtc 


25 


tcttgccatg 


ttgcccaggc 


tggtctcaaa 




gcctcccaaa 


aggctggggt 


tacaggcgtg 




taaacatcca 


gttactgttc 


tcaacttaaa 


30 


ccttgacatt 


atttctccct 


gctagggaca 




aactccttca 


aaggaatcat 


catgttcttt 


35 


acccacttca 


ataagacttt 


attttattcc 




gatcactgaa 


aacttccacc 


ctgctaaaac 




aagctatgag 


tagcatttag 


catagttagt 


40 


cttggcttcc 


agcatacatt 


gcttggctgg 




cagtctcctt 


tcctggttcc 


tctccttgtc 


45 


gctgaattct 


ttgccacctg 


ctcttctcaa 


attcttatga 


cttcaagtaa 


tatctgtatg 




caggtgtctc 


tcctgatcat 


cagactggca 


50 


tcttcctcaa 


accttctcct 


catgtcagct 




accaaacacc 


ttagaataac 


caagagtctt 


55 


cagcaaataa 


tgttggctct 


actttcaagc 




atctttctat 


tgatgacatt 


tcatccaatc 




ttttatcgtt 


gaacttctct 


ttgccttgct 


60 


caagtatatc 


atttatttgc 


tccaaatcct 



accactaact 


agtttgacaa 


ctgaaagtcc 


15720 


ctatgatgtg 


ttcttagtga 


gtcacgtagc 


15780 


ttcactttcc 


ctcctccatg 


gatagctatt 


15840 


tcccacttct 


gtcatcctta 


ctctctgctg 


15900 


aagccattag 


aagggaattt 


tcagacctcc 


15960 


atttttttct 


tctagcctct 


taccatagga 


16020 


ccctacttga 


gtataagatt 


ccattctttt 


16080 


ttgtctctct 


gctatatata 


tatttttcga 


16140 


gtggtgtggt 


cacagctcac 


tacagcctcc 


16200 


tcagcctctc 


acatagctgg 


gactacaggt 


16260 


tttttttttt 


ttttttttgt 


agagacagga 


16320 


atcttgggct 


caagcagtcc 


tcctgcctca 


16380 


agttaccttg 


ccccatctct 


aatcagcatg 


16440 


aacaaaacaa 


aacaaaacaa 


aacaagaaaa 


16500 


ggcacaattt 


gtattttcct 


tctgtagcta 


16560 


ctctgatttc 


catctaccgg 


tttgaatcaa 


16620 


ctactatcct 


tctgtaactg 


ctgtctttaa 


16680 


aacggtcaac 


tcttgttcct 


catctaactt 


16740 


cgcccactcc 


tccttccagg 


aatttcctca 


16800 


ttttcctccc 


atcttttgct 


ttctccatct 


16860 


tctgaccttt 


taatgttctg 


gtgccccaga 


16920 


tctatacgta 


cttcttctgt 


gatctcacac 


16980 


cccacaaccc 


tgaaataagt 


atttccagcc 


17040 


tgtaaagatc 


caaagctcaa 


ctcccagtct 


17100 


gatgggaatg 


tcatccttcc 


agattcccag 


17160 


ctttctttca 


caatcctgaa 


tcctatttat 


17220 


tattattata 


tccagttttg 


accttttctt 


17280 


caccttgtct 


ctcctaattg 


ggcttctagc 


17340 


agaaacagtg 


atcccttgaa 


accataaagc 


17400 


tcattggctt 


tgtatttttc 


ttagagttag 


17460 
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aaacaagttc cttttcatga tctgcagagc 
cttcttcttc atttcctagt attgactacg 
5 cccagaagat tctggcctca ggattttgag 
gatatcctcc tggctggcac ccacagctct 
taatatcatg aacccccact cctctctttg 

10 

tagcccctag tacttaatca ctttcaaaaa 
ttttttcttt gtttttattg ctagaatata 
15 ttctttggtc attgatgtta gcctcgcatc 
caatcaatgt ttgtttcaat gaattgttgc 
aattgaaaat tcttattctt ttttagatac 

20 

ctacaattgc actagtgggg actgcttttc 
cattaaggag ttgtcactct tcatcagagt 
25 tatcaaatgt atgtttctga aaaatatgta 
tgtacagaaa aataatatat caggtaccaa 
ttttttgaga cggagtctcg ctctgtcgcc 

30 

tcactgcaag ctccgccctc ctgggttcac 
nnnnnnnnnn nmmnntttt tttttgtatg 
35 ccaggatggt ctcgatctcc tgacctcgtg 
ggattacagg catgagccac cgcgcccagc 
tcaggtactc tacctcttct gaactctact 

40 

catttcatcc cagtgatcta tgaactgtta 
tctttttatt attattatta tacttgaagt 
45 tgttacatat gtatacatgt gtcgtgtttg 
ttaggtattt ctgttaatgc tatccctccc 
gtgtgatgtt ccccgccttg cgtccaagtg 

50 

agacactact tatggaagtt tgtgttctca 
tttatatttg tcggagcctg atttttctgt 
55 tattgtattg gttgcttcta ggtgcctcag 
tttatttttt gtttatatta gctcagtgtt 
acccctatga agtgctaatg aaattgtgga 

60 

cctactcctc tccttagttt actactattc 
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cccacatgat 


ttggctctct 


gttaactttc 


17520 


tgcctccaac 


gcccttgacc 


cctttgctgc 


17580 


tgtttcccta 


atttcaagga 


tattccccca 


17640 


ttcatgcctc 


tccccagtgt 


catttctatt 


17700 


gattcccaga 


ttctccttat 


cccattctgt 


17760 


tattatgtaa 


tttacctata 


gtatgtttac 


17820 


agctccatgg 


agggtggggg 


acagaaatct 


17880 


taggatatca 


gtgggtgtgt 


aaacaatatt 


17940 


caaaggaatg 


aagtacgtgc 


tgcatatatt 


18000 


agaacagttt 


tggaatgaac 


attgccagtg 


18060 


tctcactaaa 


tatagcagtt 


aatatccagt 


18120 


caccggacct 


atgcaattac 


atgggctcca 


18180 


taagtataaa 


atatttcaaa 


caatccagga 


18240 


tgtgccataa 


tttgaatctt 


tttttttttt 


18300 


caggctggag 


tacagtggag 


tgatctcggc 


18360 


agcattctcc 


tgcctcagcc 


tcnnnnnnnn 


18420 


tttagtagag 


acggggtttc 


actgtgttag 


18480 


atccgcccgc 


ctctgcctcc 


caaagtgctg 


18540 


caaccttacc 


aattttttaa 


aaatttgtgt 


18600 


tagaaattaa 


tattttcctt 


tggaattatc 


18660 


attacagact 


cactcctaag 


cttcctgttt 


18720 


tctagggtac 


acgtgcacaa 


cgttcaggtt 


18780 


tttgctgcat 


ctattaacat 


gtcatttaca 


18840 


ccagcccccc 


accccacgac 


aggccctggt 


18900 


ttctcatagt 


tcagttccca 


cctatgggtg 


18960 


ttttctttat 


tacattttta 


atgatctaaa 


19020 


gaatttggag 


gtatttttcc 


tccagggata 


19080 


taataatagt 


agaatcatta 


gcccaacaat 


19140 


ttagtttcta 


gggataatgt 


aaatgcctaa 


19200 


agggccattt 


ttgttcattt 


ttctaaactg 


19260 


agattggaag 


atgttagtga 


taatgtcaga 


19320 
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attttgtttc tacagcttct tccccctgta 
gacacatctt tctcattaga gaaaatgttt 

5 

ctacagtgtt atatgtgtct ctgtctttaa 
agggagatgt tgtagtctgg tttcctgctg 
10 actgggtttg atataaggca tcatatcccc 
accttgctgg aattatgcgt aactatctct 
tgtaattcaa gagagtgaga tttttcaaat 

15 

cctgtagggt ttgaagttgt ttctgatgct 
gtggagttcc tggaattaga gattgtatga 
20 aatttggatg tgaacatgca agaggaaata 
tatacagctg taattaagtt ttattatttt 
cctaggaaaa attaataatt ctttctaaat 

25 

atattttcat tctcttctaa tgtttgttgt 
tggcacagta aacatcaatg aatggatttt 
30 ggatgggagg agaaggagat aatgatcctg 
aggaaatttc ctcacctccc aattctgtgt 
ctgagagcat gaatatttga ccttaaatct 

35 

aggagaacat aagcctgctc gtaaagctca 
ccttggacgt ttgactgact ctatcctttc 
40 gtgggtatgg aaggacagat atatttcttt 
atcattactc caaagttgtt tccagaaatt 
ttgctttatg aggtttaagg aaggaaggcg 

45 

aaatccaact tctgaccgcc cagtaggaag 
aatgcttctt gactttaaca tcagcattat 
50 tatcattaaa ataaattttc actgtatttg 
tttcagtgaa ctgctggaac tcacacatgc 
gagtctgaga gcaagcccaa atgtgttctt 

55 

aactaaagaa taaggccttt agaatctgac 
ttattacctg tatgaacatg ggcaaattat 
60 gtaaaatagg tgtaataata acaactactt 



gctacttcac atgcgttgac atagaaatgt 19380 

tcatgctttt agattgagat actatctctt 19440 

tttatggttc actttgttcc actttacggg 19500 

acattttttt ttttacctct ggatgttgcc 19560 

tagggcatgg tgtctctact gctgattctc 19620 

accatagcca tgtggtgcaa tgcaaactgc 19680 

gattaatcat tcacatgatc tcaaagcctc 1974 0 

tagaaataaa cctttatgtg gtaatgattt 19800 

atgttagaga tggaatgaag ggtagaaaag 19860 

ctgtacttga tatataagag catactttct 1992 0 

cttataactt gtaactttct aattgtatga 19980 

tttagatata ttaactgtaa aatgaaaata 2004 0 

gagtactgtg tatataatct ttatggcatt 20100 

attgttgatg attatgatga taacaggaag 20160 

ttcttggaca ttaatgttgt tttattttct 20220 

aatcaagaat acctccttat gaaaataatt 20280 

ccagtgactc agagcttcac ccacaaactc 20340 

atccttctat catggcacca atcacaagaa 20400 

tctcctaact ataaatccta tttgtgtgtc 20460 

aggcattctt ggatatctgt aacttctatg 20520 

ggttctattt cttcttatcc acctactcca 20580 

gtataatccc tattcaatat attttttcta 20640 

aaaaatgaga cattttttcc attacagaga 20700 

aaaaagtgtc aaataaaaaa ttaccatcat 20760 

agatgggagg gttaaggctc agggatttta 20820 

cctgatatgt aaatgatgat ttatgttggc 20880 

caaaggacaa tgggaaactg taaagtagag 20940 

acatctgggt tcaaattctg aaactgtcac 21000 

ctaatctctc tgatctattt ttcctcatct 21060 

tgtcggttgc tctgagggtt aaatgaaaat 21120 



-114- 



WO 02/062946 

aaaaagaaaa tgtgaaacag caccacaggt 
tcttgctgtt gctggtgaaa gctaaggtgg 
5 gtggaaaact tacttacact tccattgtgg 
gccatctttt cgaggtggca atgtggcgtg 
ctgctaaagc acttatattt cttattcata 

10 

agaaatataa gcacttattg tgtccataaa 
gtatatcgta tgtgcaaagg ttcttttcta 

15 ttctttattt tttaatccat tctgcagcgg 
ttgaggagga attatttaat tatttaggtg 
actgagcccc aaacaaagag tccggttacc 
ctgtgcaggg ggaagcatat tacagaagtg 
tgtattacat tatttcttat tttttcaagg 

25 gcctagtgac cttgcacctg cacagctaga 
ggagagataa ggcttactag ccgcagaaaa 
ctttctcttc ttcaggggga attgggtttc 

30 

attctttaat ttcttttaat tcctgttcta 
tgtgtcacgt attgtgctaa gagcttggca 
35 cttaagttgc ttgtagtggg agatacagac 
agagtatagg aatcttgaga agcacgtagg 
tcagtgataa ttttttagaa cacttttctt 

40 

ttaacatgat ggaaggcaaa tggaagaaaa 
aaaattttga atattcaaag aataaactag 
45 atgggcagat ggaaaaataa aagtttggag 
gacagcactg tgagtttttt tttatttttt 
ttattgatca ttcttgggtg tttctcacag 

50 

gtggagggaa ggtcagcaga taaacaagtg 
gaccctgcgg ccttccgagg tgtttgtgtc 
55 gactcttaac gagcatgctg ccttcaagca 
cttaatccat tgaaccctga gtggacagag 
taaggtcata gatcaacagc atcc 

60 
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acttgacaat 


gatgttgctg 


ttgtggttgg 


21180 


gattgtggtc 


agatgtgctg 


gagagataca 


21240 


ctcctcgtat 


aacctccaac 


aactccaata 


21300 


aaaacttccc 


tcagttcctc 


gtttataaat 


21360 


aatatccaag 


aaaaataaaa 


tttatgaata 


21420 


tttataccaa 


taatttattc 


ttagttacgt 


21480 


acacttgttg 


cctacttata 


taataattat 


21540 


gataattaag 


gaaccagaga 


gactgagggg 


21600 


caccgaccca 


gtcggattaa 


catccaaagg 


21660 


ttttaagcat 


tttgtgcagc 


gggggacgat 


21720 


agaaacaaag 


acagttattc 


aattaagaca 


21780 


aaaaacatgt 


tttacgactt 


gagtttatct 


21840 


gaaatagggt 


cttcacaatg 


cctgggaaag 


21900 


acaggcagtt 


aattttaaag 


gactccagct 


21960 


tcttacatac 


aaccgagctt 


ttgcttacac 


22020 


attcaacaat 


catttaggga 


gtgcctacag 


22080 


tataatagag 


aatgatgcaa 


aatctctgct 


22140 


ccctagaaac 


acacaagggc 


tctcatagga 


22200 


acgagattct 


aacccagatt 


agaagacatg 


22260 


ctcaactgag 


ttcttaaaga 


taactgagcg 


22320 


gaggatacat 


taaaagcctt 


taagttagaa 


22380 


agactttagt 


gggtctaact 


tcagtgtata 


22440 


agaggtgtac 


atcacaagag 


attataaaat 


22500 


ttatttttta 


ttttttattt 


atttattttt 


22560 


agggggattt 


ggcagggtca 


taggacaata 


22620 


aacaaaggtc 


t- _ 4_ *- *- *_ 

cccggccccc 


c caggcagag 




cctgggtact 


tgagattagg 


gagtggtgat 


22740 


tctgtttaac 


aaagcacatc 


ttgcaccgcc 


22800 


cacatgtttc 


agagagcaca 


gggttggggg 


22860 



22884 
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<221> coding_region 
<222> (43668) . . (43736) 

5 

<400> 76 

nncaactccc tcaactggaa gttacctgat 
10 tttgtggaag gagatcacag atataagagc 
actatcaaca tgacttacca ctgttactct 
cttcttcact gtaaatttac tctttttatt 

15 

agtcactatt cacagcccag acttaagggg 
gatggcctgt gtgttctttt gccataggct 
20 ctgtgattca aacagcaaat ggagtcaatg 
ctttagaggt cctcctgagt gtcctgttgg 
atggtaagcc agtagatgag tatattcaag 

25 

ccagtagcca aataggtcta ccaattgctg 
gacagagaag ttcaaaatct tgacaatcat 
30 aattatgccc aggagtttta ttagtttcat 
tgtgttaatg gcccagctgg gttgctgcat 
ggcagtccct tcatctgagc ccaccaggaa 

35 

tttctccagg agtgagactg tgtgtaactc 
gagcttaggt ggccttgcct gtgcatgtgc 
40 gcactgcaag ggacagtgtc tggagtgaca 
gttctacaag attgccttct ttatcatgct 
aaaccaccca cgcattcgca gtataattac 

45 

gattatattt ccttgtgttt taaatatgat 
tctccatctc ttagtgtgct ggttgggttt 
50 gcattccctt tttaaaaaaa aatcttctga 
atttttagta tcatataact cacttctcag 
ttcaatttgc atatgtatat tttaaattac 

55 

taaatagtaa gtatagtgta ttcattttta 
gtataacatc ttatattttc ctttctattc 
60 tttctttttt tctattagac tgccaaaatt 
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tttttctcat 


gattagactg 


atgtttgggt 


60 


cagtatcatc 


acatcccatc 


aacagcccac 


120 


tagtcacttg 


gttgaggtag 


tgatgtcagg 


180 


cccctttaca 


ttctgtattc 


tttggaagga 


240 


tggggagtta 


cgtttatgaa 


tccaaagccc 


300 


ccactcagtg aactgagagg 


ttgctgagac 


360 


gggtccagga gaagagtggg 


gtgggtggcc 


420 


ttggggtcca 


actccatggt 


aacagtcttt 


480 


agatctacat 


caggtattgt 


aggagtgata 


540 


gggttttttt 


tgtggttttt 


ggtggggagg 


600 


ctttgcaatg agaggctttt 


ctgctttctg 


660 


ttggatatta 


ttaggtcttt 


ttatcttgtc 


720 


gtggatcaac 


agggtgtcta 


gtccttgttg 


780 


gagacaatca atatagcaga 


agggcagagc 


840 


atggcccatc 


actccatggc 


agatacctgg 


900 


tctgctgtat 


tggtggtggc 


tgttctcagg 


960 


tggctgcatg gtgcccctct 


ccctcactaa 


1020 


tacctctgaa gtctgcagtg 


aaatattatt 


1080 


aacttggtaa tgatatatgt 


tacactgtgg 


1140 


ttttcagaga 


ctgaaaaata 


tctgaaaact 


1200 


agatcaagat 


cttttaaaat 


gaatttgtga 


1260 


atattttgaa 


taaaatttcc 


attgtctatt 


1320 


ggcatggtgt 


tttatttatt 


taattattta 


1380 


ttctaatttt 


cagggtatta 


atatttattt 


1440 


tttagattac 


taatatatca 


gattttaaat 


1500 


accattattt 


aattgttctt 


atttttccaa 


1560 


ttgtttaatt 


tccttcttat 


tatttgatgg 


1620 
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aagacccaag aattgcccta tggattaaac 
aagttgtacc atatttaagt tggcttcgca 
5 aagagagaaa aggcagtatt cctggaaaac 
agaaacacac attactattt aggcagagtt 
taatgaagta ttaagtgcat ggggcttttg 

10 

cacatcagtt agctgtgatc attgttggaa 
tagggcagac aaaggagtga cctgtagtat 

15 cactttcctc ttcccagcaa tctgtgactt 
tgactccgtc cctgcctcca ctgtgctctg 
ccccctttcc actgcaatga ttgaatcagc 
gaatgattct caatcgtctt gttttgaaac 
aaactacagg gtgatatatg gatgtaaggc 

25 ggtgggaaat cagcctaagc atgaagccaa 
cacaaaggta cacaagagtt ggggctggag 
agctccaatc ttatctgaag ttcacctcaa 

30 

ttttttgttt tgactttctc tatatgataa 
gcatttagaa acacaaaaaa tgatttgttt 
35 cagattgtaa tttagataag taaacaatca 
ggatgaaaat tgaggagaga aaaacaaaga 
taggttccca cttcaaacag cttatccagt 

40 

acccccfctgc ctttgctgca ccccaacttg 
aaccccagtc cacattcctt tccttattag 
45 aatgacccct cgtctctctc ttctcacctt 
taaatgttta gccagtcacg actagttaga 
aacacacaga agcagaagct gcattagaga 

50 

gtgctctcgc cattgctcca tatgcgagac 
ctgagagacc ctttgtcctt tgtttcagcg 
55 tttccaacaa tggggacagg aatggaaaaa 
ttctcttgac taatatatga aaaacactgg 
cccaccagga atttaggctc cttatgggta 

60 

aaaccgatac taaattgaca aatcttgatg 
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tgagttggag 


agaggcaata 


aaaatctttc 


3300 


caccaccaag 


taaaaaagat 


aagatcccct 


3360 


ctcagacaaa 


cagtttttct 


aagctgtagg 


3420 


aatacctcat 


a ggctgtggc 


attcaatget 


3480 
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actattttgg 


accttacggt 


gcaaaagaaa 


tacattaata 


cactgcctaa 


cattcatatc 


3540 


5 


aagtaattac 


actgtctact 


attgctacat 


tgtcttattt 


tgctcttgtt 


ccaaattcac 


3600 




ataattttcc 


ctgttgttat 


tactctcttt 


tcatgaatga 


agaggttttt 


gaaaacaggt 


3660 




ttgctacagc 


aacaattctc 


tgcttcacac 


aggagggact 


cctaagtgcc 


agcctctgta 


3720 


10 


tatgtgaaca 


gggctgccat 


tttgttctca 


ccagtctaaa 


gttatcctag 


gttcctattc 


3780 




caaatgagct 


accactcaca 


gttattttat 


tcagccaata 


tttattaagt 


atctggccca 


3840 


15 


tgcagaccta 


tgcaagagat 


cagtaatacc 


tgtagaagat 


aatgtccctg 


cccttgaaca 


3900 


cagtacatag 


ggagcatcac 


gagagcacaa 


aaagttcaga 


tgtgttggct 


aaaatttccc 


3960 




ttgggaagag 


gaggattcat 


tgaaaattaa 


cattcaaata 


atgctttcaa 


agaagagcag 


4020 


20 


gtgcttgcag 


ccatatgcat 


tctaggtagg 


ggaatcagtg 


cgcacagagg 


ctagagccct 


4080 




gaaacagttt 


ggcttactgg 


ggggaactgc 


aaacactttg 


ttatgattct 


gtcacagtgg 


4140 


25 


gagaagtgac 


agtttcattt 


gggtgagagg 


taagatcctg 


gacgtgcaaa 


atgtggaatt 


4200 


cacaggactt 


caaatgctta 


ctggagacat 


tgggctttgt 


catgcagtca 


atgtgcagac 


4260 




acagaaagat 


tcaagcagca 


cagtgacaaa 


atcccatttc 


tattccaaaa 


gttcctatta 


4320 


30 


cagtagtgaa 


tgtaagggta 


cggaaaggca 


aatgaaggtc 


agaaaatcca 


aaacaataag 


4380 




acatgggata 


gaatattaat 


attaggcaat 


aaaagttcta 


aggaaaattc 


tcaggtatga 


4440 


35 


caaaggaggg 


catttataat 


gatatgttgt 


ttagaatctt 


atgatcaaat 


aacatataat 


4500 


attctactaa 


taaaaaaatg 


cctaggagtg 


tgagaaacta 


acaaaaacac 


agtactcatg 


4560 




agaattttta 


acatatgtaa 


gttttgagga 


atgtaagcca 


aaaaagatga 


tttgaattac 


4620 


40 


aaaatacttg 


aatgaatcaa 


cactaaagaa 


agcatgcttt 


ttaagaaccc 


ttggtacatt 


4680 




ggcaacaatt 


gaccaaatat 


tcatcccaaa 


acctcactaa 


cttaaaacaa 


aaaatccagc 


4740 


45 


ccacattaac 


taatcaataa 


agctagaggt 


aagtaataca 


aatatttgtt 


accaaaataa 


4800 


gattaatttc 


aagcccaatg 


aatatttata 


gacttttaat 


tgttaataac 


cagattaaat 


4860 




gcctaaagtg 


aatatatttt 


ggtcatacta 


aaacaaaaat 


ataaagtgtt 


cagtaattca 


4920 


50 


cttaagtgga 


aacatgaaga 


atttttatga 


ggaaagacat 


acgcctttta 


tttagaaaca 


4.980 




tagacagaga 


cttgactgta 


ttgacttatt 


tctggttgtc 


agttattcca 


catttttttc 


5040 


55 


ggacgtaata 


taatatactt 


tcaaataaat 


ctcttgtgga 


aattgactta 


atgttcttta 


5100 


tgtccatgtg 


gaatgagtat 


gactagccaa 


aagtatttta 


ataaagacta 


atgttgaagt 


5160 




aaagacagta 


ctatcagaaa 


ttaaaatgga 


gtatacaata 


actaatacaa 


gtgatgcgtt 


5220 


60 


gtgttaaaaa 


agactgaatg 


gcaagaaaat 


gagaatagaa 


tgcccacata 


gtaatcatag 


5280 
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aaaaaatata aatggatgaa gtgcaggacc 
catgcactta tgcatgaaaa aatggatttg 
5 gaaaagtgat gtgttttgaa atgcaaaaag 
tgagagcacc acaagaggaa cagttaagat 
acaggacatg caacattgca cttttcacag 
cccaggtact ttagttcagt aagactatgt 
gtggtgaaac tcaagccctg ccgtcttctt 
15 tgcccatgag gcaatttcat ggcatggcat 
gtcaagtgag gaagactgaa gccgaagact 
ttgtccttac ttgtatttta aaaacccagg 

20 

atatctgaca tttggaacct tgtattatta 
gcacaggctc gggactgatc ctgtaacaaa 
25 tctcagccaa atggctcagg gttgacccta 
gctgcctacc atgccctgaa taaccagtta 
cagggggtgt taatgtcatc ctttgtggag 

30 

ttttcatcaa gaatttagtt gctgatccta 
atttataaaa cgagagtgtc cagattgaag 
35 tggaatacag atcaaattta atcaggtaaa 
aatcacctgc acaaacctga gatgggacta 
aggcctgggc ccttctacta gttagctgaa 

40 

agaagtctgc ctttgcctct agagcaatgg 
ttgggaaact ttaacacgta ctaacacatg 
45 aatatggatt atggttggag agtcaaaatt 
tagtcaatgt tgccaattac tgctataagc 
aatctggagg gcttcttaaa acacagaatg 

50 

ggggtagggt atttgcatgt caaccacaat 
aaccacattt taagagctac tgctctagac 
55 gctttttaaa ttctaccgct ggtattgagc 
tcttctcttc tgagtaggat ttattccata 
ctgtgtctgg agggtagggc caagttggac 

60 

taaggacaga gagtaggatg tgatgtcttg 
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agacaattta 


actaaagaaa 


tagaaatggc 


5340 


aaagcaaata 


atttgtataa 


tgaaataaac 


5400 


aaaaaaaaag 


tggttggtgg 


agaaaa'getc 


5460 


atgaaagtcc 


ctggaaaaga 


ggaagctggg 


5520 


gatcagcacc 


aagagcttgg 


agttcaagtt 


5580 


gatagctgtt 


atgeggtcat 


ctgggggctg 


5640 


attgtgttta 


gtcggaatgt 


acaacaatta 


5700 


tggtatattt 


gcttcagggc 


tagagtgaca 


5760 


gaacatcaga 


gttatacatc 


ttgacatcct 


5820 


cccagataac 


tccaacactc 


ggtttctcct 


5880 


gttcagaaga 


agtactgaag 


aggectcagt 


5940 


ggtgatcaga 


tgacacccag 


gcccagctgg 


6000 


aatatggaaa 


accagagaat 


caggactatg 


6060 


tatggattgt 


catctggtag 


aaatcctatg 


6120 


tctcttggtt 


ggatcttcaa 


ccacttaata 


6180 


tttgcaaatg 


atccaaaaac 


aagaagtata 


6240 


atacaaccca 


aaagtctgga 


acatgagegt 


6300 


attaaagtcc 


caagcttatg 


tttaaaaaca 


6360 


gaacttgctt 


aacagaagtt 


aatgtgagaa 


6420 


aggatggaga 


cagaagcaaa 


agactctcca 


6480 


atctcaatct 


tgtctgagca 


ttgggataac 


6540 


agccccatta 


ccaggtattc 


tgatttaatt 


6600 


ttttgtagct 


etttagegga 


ttctaccgta 


6660 


ctgtagatat 


caaacttgag 


tgtgtatcaa 


6720 


cacaccgaga 


ctgtctgatt 


caataggtct 


6780 


gtcaggtgat 


actgatgeta 


ctactctggg 


6840 


gaacctggag 


ttctccagat 


gcaactaaag 


6900 


cag a a eg cat, 


agao.ee tygc 


CCCytyCCta 


COCA 


agetttggee 


ttctctttac 


atcttcatat 


7020 


ttegggatae 


taaactgagg 


ccttttattg 


7080 


atgetgette 


aggttactag 


aggagacaat 


7140 
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gatctcagtt cagaaccaag cttaaagtta 
atgcaagcta ggatttactt ttcaatcttt 

5 

tttgggaaga aattagggta aatgagcaat 
tctcctagca ctcatggtat ggcctgttca 
10 tgcacaattt tcaggagtcc agaaatacaa 
caattgttgt catctctttc cacacacaca 
caggcagttc ttacaatatt gagatgcttt 

15 

atttacatct tataaaaata gaacatcaga 
tgaaatatga gaagtctgtg gcccagggaa 
20 atgaatgtat aacccagttt ttctgttttt 
tgtcatattt aattgagcca caagagggag 
tgaattccta cctaatctgg tgtcttctgg 

25 

ctggtcttgt atactagcgc tgggttgggt 
tagtagatac agcaagtcat ccctctgacc 
30 gtattattta gatattatta cttatattga 
ttttcaaaac atcacaagtt gattttcaag 
tgctttttgt accatcctaa ctttatacca 

35 

cactctggtg aactgaaaaa catattttag 
cctctccctt ctcctgtctt tccccacatt 
40 gagggtgtgg atacagctag agggagaggt 
tctgaaaaat ctgggttata aatggcacac 
ttttggaaca tggatcttat tgcagaaggt 

45 

ccaggtggca acatctgtgc ctggcttctt 
cataagggaa ataacctgaa ccagcccgac 
50 atagatcaca cattaaggaa atatgtccat 
atggatctag aagtgacttt gaaagtgaat 
actgatggaa atgaaccaag catttatttt 

55 

gggtggagag aggattaaaa gaagtgaaat 
acccattccc atcatggttg ggccagtacc 
60 ccctttattt tacacagtat agaaattcac 
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tatatctctc 


ggggactatt 


tgtcccagga 


7200 


ccccagtttc 


tttcccaaaa 


gtaacatatc 


7260 


ggtaagctca 


aatcctggca 


tcaggaaaat 


7320 


ccactgacca 


caaagacatg 


tagaaggtgg 


7380 


acagttctct 


taatacttgc 


tctgtagtct 


7440 


tgcatgcaca 


cacacgcaca 


cacatgcaca 


7500 


gcttagactg 


ggctgtatat 


aagagacctg 


7560 


gttagaagag 


acgtggaaat 


ctttctacac 


7620 


agaaagtgat 


tcacgtgagt 


catgaaggta 


7680 


cctattatcc 


ccttcatttt 


gacattgagg 


7740 


gcagtggaca 


agaaattgtc 


ctacctaatt 


7800 


gaagacctta 


tgaaaagtca 


actcccagct 


7860 


cccatttcta 


tcccatattc 


tctgggaatc 


7920 


cttagctttt 


tcatctgcta 


aattttgtag 


7980 


aaggtgttag 


tgaggatacc 


tgagaatgaa 


8040 


atagtattaa 


agtgctttaa 


tttcaaaggg 


8100 


cttttttgtc 


cactcccacc 


ccattctctg 


8160 


attttatcta 


attgttattt 


actctactct 


8220 


ctgtaaaatg 


tgagcacatg 


catgatggag 


8280 


tgagactgca 


catctcacct 


ggtctcatat 


8340 


cagagacatt 


accatgccct 


cctctctgct 


8400 


gagggtttgg 


gaaggcttcg 


ggatgagtgg 


8460 


cccttcttgg 


tggaacagcc 


actggcctct 


8520 


cctgagagcc 


tccctcatgg 


tgttggactt 


8580 


gagagatttt 


aggatgactc 


atatggtttc 


8640 


gaaccaacct 


agtagtttct 


agtgaagctg 


8700 


ccaatgccat 


agagtggatc 


ggtgaaagct 


8760 


aaagttagga 


cttctgaacg 


tttctctaat 


8820 


atttttggag 


tttttgtttt 


gtttctcttt 


8880 


tgactactcc 


tcagattttt 


ccagtcctgc 


8940 
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ctccccagag ggactcacat cccttttcat 
ctcctaggat tcccactcgt tactcaaaat 
5 ttaggtgaat aaaatttaat ctaataatat 
tgagataatg agtggaacca cgcatttatt 
agttatctgg accaacaagt ggtaagagca 

10 

ctgaggaaac agtacgtctg gagacacatt 
gttgtctgag atttctgggg ggctggatta 
15 ttttgagtga ggtaggaggc attggaagat 
gcatcttaac aggaccactc tggctgcctg 
gggaagatgt ggggagagta gcgaggaggc 

20 

ggttgtgttg ttggaacaca gcatcactgc 
gcctctcttt ctttcactcc ccacattcag 
25 tgtccccaag aaacataatt catcccatgt 
cgcagaattt gggcagagtt aagggaacga 
aaagcaggaa ggtatgagaa gagaacacac 

30 

ttgtaagaag agctgctttg tagtataact 
gtcaatgcag agagaatagg aaagcaatgc 
35 tctccaaaca aacctgaatg gtttgcttta 
attttcccag agtgacacaa ttggcaaatg 
tttttagccc tgtggtcttt ctattacact 

40 

aatgagactt ggctttggct cacagagaca 
gtgagaagat atattgtcgg ctcattccct 
45 gcattttttt tttttggaat gaaatatatt 
cctgatgttt tctgtagctt tcttttttta 
atagttttta aataaataat ctgtacacca 

50 

gtgtaggtac caacctccca ggattttagt 
cccaccatcc tctctcttgg actactgtcg 
55 cctcatctcc cttcagtctg ttcttatcat 
actagtcaga tcctgtgttc cttttgctga 
ataaacctaa gtccttaata agacctcaag 

60 

tcagacaaca ttgccttctc ttctccctat 
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ggcagtaacc 


acctggtgtt 


gctcgagcac 


9000 


ggataataaa 


gaaattttct 


caaacgcaat 


9060 


gtgtgaacta 


ttcttctgaa 


ataataatgc 


9120 


ttcttgaacc 


tgataataca 


aatccctagc 


9180 


gaggtcttga 


ggtgagcacg 


ttccatgctt 


9240 


gagactggaa 


tagggagagg 


agaaggcaat 


9300 


taaagcgact 


tattaaggag 


atacggctta 


9360 


tttcagcaaa 


gggaagacag 


gatctaactt 


9420 


tcttggtaag 


aaccgactga 


agggaggcga 


9480 


tccggcagta 


gtcccagaga 


gaggatggtg 


9540 


tcattcaaat 


cctaggaggt 


gggtgcctac 


9600 


tcattttgaa 


ggctcaggta 


tcagtaaaag 


9660 


gacttattta 


agggattttt 


atgaagacct 


9720 


agaaggaaag 


ggaggtcact 


acagacaaga 


9780 


tattatgggg 


accaataaag 


agcagaagtc 


9840 


gtggaagaag 


tagctacttc 


caaaggctgt 


9900 


tttgtccttt 


ctctctttcc 


agcctctaat 


9960 


ctgtcacaca 


tttatagggg 


atttgtggag 


10020 


gctgaattgt 


ccggagctca 


gacctgctgc 


10080 


acctggtccc 


cctccagaga 


gaaacaacta 


10140 


atgaataggg 


aaagttggaa 


gtctaagtgg 


10200 


tttttgccct 


cccaagagaa 


aaaagagctg 


10260 


ttctttttct 


gatagtgttc 


cactgctctt 


10320 


aaacttttat 


tttatgttca 


gggtacatgt 


10380 


aacccttgtg 


acatgcagtt 


tgaaagacaa 


10440 


gagaagcaac 


gctgcatccc 


aacaacccaa 


10500 


gagcctcctc 


actatcctcc 


ctacgtcttc 


10560 


ggcaagcagc 


cagagtgata 


c cgtcaagac 


lUoz U 


aaatcttcca 


gtgcctctat 


ttcactcaga 


10680 


ttcttttata 


attgaggccc 


cgcagacacc 


10740 


cccaatctca 


atgtgtctcc 


agacacactg 


10800 
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tcccctcaga 


agcacaggaa 


cctgctcccc 


tcaagagctt 


tgcactttcc 


acttctttgg 


10860 


5 


ttcaaataac 


tgctagtgag 


ctctaaggat 


ttgcattatg 


aggttcaaga 


aaataaatgc 


10920 


ttacatgcca 


cagaggtttg 


gtgtaccgat 


tattttatca 


cccatgtaat 


aagcatagta 


10980 




ctcaatagat 


agttttttaa 


tccttaccct 


cttctcaccc 


tctaccctcc 


agtaggaccc 


11040 


10 


agtgtctgtt 


tttcccttct 


ttgtgtccat 


gtgtactcag 


tgtttagcta 


ttactcataa 


11100 




gtgagaatgt 


gtgatattta 


gttttctgtt 


tctgcatgag 


tgtgcttagg 


ataaaggaat 


11160 


15 


ccagctccat 


ccgtgtttct 


tcaaagaaca 


tgaactcatt 


ctttcttatg 


gctgtgtagt 


11220 


ttcccatggt 


gtatacgtac 


cgttttttcc 


agtctaccat 


tgatgggcat 


tcagtatgat 


11280 




ttcatgtctc 


tgctatgaac 


atatgtggac 


atgtatcttt 


atggtaaaat 


aatttatgtt 


11340 


20 


cctttgggta 


tatacctaat 


aatgggatag 


ctgggtcgaa 


tggaacttct 


gttttaagtt 


11400 




ctttgagaaa 


ttgccacact 


actttgcaca 


aggctgaact 


aatttacatt 


cccacaagca 


11460 


25 


gtgtatggtg 


ttatttttcc 


tccacaacct 


ttttagcatc 


tgttactttt 


tgacatttta 


11520 


ataatagcca 


ctctgactgg 


catgagatgg 


ccacagtctc 


cttgtagttt 


tgatttgtga 


11580 




ttctctaatg 


attagtgatg 


ttgagagttt 


tttaagtata 


cttgttggct 


gtgtgtatgt 


11640 


30 


cttcttttaa 


aaagtgtctg 


ttcatgtcct 


ttgcccacct 


tttaatgggg 


ttgattgttt 


11700 




ttcacttgtt 


aatttaattt 


ccttatagat 


tcttgatatt 


agacctttgt 


cagatgcata 


11760 


35 


gtttgcacat 


actttctccc 


attctgtagg 


ctgtctgttt 


actctgttga 


tagtttcttt 


11820 


tgctgtggag 


aagctcttta 


attaggtccc 


acttgtccat 


tttttttttt 


ttttgtgcaa 


11880 




ttgcttttga 


catctttgac 


atgaaatctt 


ttccagggcc 


tatgtccaga 


atggtatttc 


11940 


40 


ctaggttatt 


tttcatggtt 


tttataggct 


tagattttac 


atttaaatct 


ttaatccatc 


12000 




attagttgat 


ttttgtatat 


agtgtaagtt 


agggatccag 


cttcaatctt 


ctgtagatgg 


12060 


45 


ctagccagtt 


atcccattac 


catttattaa 


atagggatcc 


tttccacatt 


gcttgttttt 


12120 


gctgactttg 


ttaagtatca 


gatggttgta 


ggcatgtcac 


tttctttctg 


ggatctttat 


12180 




tctgttccat 


tggtcaatgt 


gtctgttttt 


gtaccattac 


catgctgttt 


tggttacagt 


12240 


50 


agctttgtaa 


tatagtttga 


agtcaggtac 


catgatgcct 


ccagctttgt 


tctttttgct 


12300 




taggatttcc 


atggctattc 


aggctctttt 


cagttccata 


tgaattataa 


aatagtttat 


12360 


55 


tttctaattc 


tgtgaagaat 


gtcattggta 


gtttgatagg 


aacagcattg 


aatctgtaga 


12420 


ttgctgttag 


cacaatgact 


attttaataa 


tattgattct 


tcctatccat 


gagcatggaa 


12480 




tgctttttta 


tttgtctgtg 


tcatctctga 


ttactttgaa 


cactgttttc 


taatcctcat 


12540 


60 


tgtagagatc 


tttcacctct 


ctcattaact 


aaattcctag 


atattttatt 


cttttttttt 


12600 
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ttttggccta ttgtgaatga tttggctctc 
tactgattta tatacattaa ttttatatcc 
5 taagaggttt tgggcagact atggggtttt 
gagatagttt gacttcttct tttcctattt 
ttgctttgtt tagaacttcc agtactatga 

10 

ttgtttcagt catccaggag aactttccag 
tggtttgcca tagatggctc ttattatttt 
15 agtttgttga gggtttctta acatgaagac 
atctattgag atgatcatgt ttttagttct 
accttacgtt aaaccaacct tacatccaag 

20 

gattttttat gtgctgctgg atttggtttg 
ctgttcatca aggatagtgg cctaaagttt 
25 gtgtgtgtgt ctgccagatt ttggtatcag 
gaagatcctt tcctcctgaa tttttggaat 
tatacatttg gtagaattgg gctatgaatg 

30 

ggctttttgt tgtgatttaa tttcagaact 
tcactggttt actcttggga ggttgtatgt 
35 ttctagcttg tgttcataga ggtgttcata 
gggattagtg ataatgttgc cattttcatt 
ttttctttat tagtctagct actggtctat 

40 

tcctggattt tttaatcttt ttatggtttt 
ggtactggtt acttcttgtc ttctgctagc 
45 gtcctctgtg tgtgatgtta ggttattcat 
atttagcact atacattttc ctcttaatgc 
tgttatatct ttgctctcat tagtttcaaa 

50 

gtttacccaa cagtcattca ggaacagatt 
aaagatattc ttggtattta tttcagtttt 
55 tatgattttt tttaactgct gaggattgtg 
tatgtaccat gtgcacatga gaagaatgtt 
tagtagtctg ctaaacccat ttggtgaaat 

60 

gttttctgcc tctatgatct gtctaatact 
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agcttggatg 


ctgttggtgt 


ataaaaatgc 


12660 


tgaaactttg 


ctgaagttgt 


ctatcagatg 


12720 


ccaggtatag 


aattatatca 


tctgcaatca 


12780 


ggatgcattt 


tagttctttc 


tcttgcctga 


12840 


aaaggagtgg 


tgagagtggg 


catccttgtc 


12900 


tttttgccca 


ttcaatatgg 


tgttggctgt 


12960 


attattttga 


ggtatattcc 


ttcaatgcct 


13020 


atgtagaatt 


ttatcaaaag 


ccctttccgc 


13080 


gtttacgtgg 


tgaattacat 


ttattgactc 


13140 


agataaagcc 


tacttgatta 


tggtggatta 


13200 


ctagtatttt 


ttttgagtgt 


ttttgaacct 


13260 


tcgtgtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


13320 


aatgatgttg 


gcctcataga 


atgagttaga 


13380 


agtttcagta 


ggacggtgtc 


aactcttctt 


13440 


catctggtcc 


tgagcttttt 


ttagttggta 


13500 


tgttattggt 


ttgttcaagg 


tttcagtttc 


13560 


ttccaggaat 


ttatctattt 


cttctatgtt 


13620 


gttgtctctg 


aacatttttt 


gtatttctgt 


13680 


tctgactgtg 


tttatgtggg 


ttttctcttt 


13740 


caatcttatt 


tattctttcg 


aggaatctcc 


13800 


tcacatttca 


atattgttta 


gtttagctct 


13860 


tttggggttt 


gttagctctt 


gtttctgtag 


13920 


ttgagatctt 


tctaactttt 


tgatttgggc 


13980 


tgctctagct 


atgtcccaga 


gattctcgta 


14040 


gaatttcttg 


atttctgact 


gaatttcatt 


14100 


gtttaatttc 


catttaacta 


tgtagttttg 


14160 


tattgcactg 


tggtctgaca 


gtgtgtttgg 


14220 


ttatgcccgg 


ttgcatggtt 


gattttagag 


14280 


tactctgctg 


tttgggggtg 


gagagttctg 


14340 


gttgagttca 


gatcctgaac 


atctttatta 


14400 


accaatgggg 


tgtcaaaatc 


tcctgctatt 


14460 
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atcatgtggt tatctaaatc tcttcatatg 
gctctggggt ttgatgcata tgtatttatg 

5 

ttaccattat ataatgccct tctttatctt 
gaaattagaa tagcaactgc ttcatttttc 
10 atccctttac tttaagttga tgggtgtcat 
atatagttgg gtcttgcctc tttatccaat 
ttatcccatt catattcaag gttaatattg 

15 

tgtcagttgg ttattatgca gacttgaatt 
aacagagctt cccagcttac caggagcaga 
20 atgaactgct ggatgttcaa tgtggactaa 
cctggggaga ggtcaaacac ttgcatggct 
gtgtaaagaa cagagaaaaa tcttgtgctt 

25 

taaatactcc agggcattct gttgtcttta 
ttttttcaga ttctgattga ttctgattga 
30 ggaaaaggtc aatacacaac tccaccttcc 
tcagctacat ctctctctcg tatcctgcct 
tttgtgaagt tgataactga gaagcacaag 

35 

ctagaggatg tttcccctca ctctacacta 
acaggagatt atagtgaatg aactgtaaac 
40 ggagatccaa taataatagg ggagaccaaa 
gacccaaata gctacaacaa acaataaatg 
accttacata aaaggcctat ttactttact 

45 

ttacaatgta tgttaaaagg taaaagacaa 
aagattcaaa tgcggcagag attttggaat 
50 ttaatattat aaggacttta acaggaaaag 
taagcagaga gaagaaattt ctaggaaaga 
cagtaataaa aattttaaaa atgtttttga 

55 

ggaaagagtc agtgagcttg aagatacgtc 
ggaaaaaaaa tggaaaaaaa aagaacagaa 
60 gaataacatt acacataatg gaaataccat 
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tatctaagaa 


tttcttttgt 


gaatctgggt 


14520 


acagttaggt 


cttcttgttg 


aattgaacct 


14580 


ttaaaatcat 


tgttaaaatc 


tgtttggttt 


14640 


tgttttcttt 


tgcttggtag 


atttttctcc 


14700 


ttcttgtgaa 


atggatcttc 


tgtggacaac 


14760 


ttgccacttt 


gtgcctttta 


attggtgcat 


14820 


atatgtgcag 


gtttgatctt 


gacatcatgt 


14880 


tttgtcccga 


aatttggaca 


ggaaggccca 


14940 


agcacaatag 


gaacacatca 


attataaccg 


15000 


cttgggagct 


gaaactccat 


gaggcccagt 


15060 


tttacctcaa 


agtgtcctag 


caggtattta 


15120 


ctgattaagg 


aaggtaatat 


aaccattttt 


15180 


acaaaaaaag 


tcttccctca 


agggaaacta 


15240 


tttggataat 


accaaaacct 


aaccagtctt 


15300 


tctatccttc 


cacgtgggga 


aactgaatat 


15360 


gacttaagtt 


caggtagggg 


ctaagaagca 


15420 


ttgctaaaag 


gctgagacct 


aattatgata 


15480 


ccatcagatg 


aatcaggatc 


ctgtatgata 


15540 


ctcagaccct 


atttaagaag 


tggtgtctag 


15600 


acaaggacat 


tacaggacat 


ttcagccttt 


15660 


cagcctaact 


tctagccaga 


taaacataat 


15720 


tccttttatg 


tccagctatc 


aaaaaaaaaa 


15780 


agtgtgaaga 


aacaaagcaa 


gcaacagaac 


15840 


tatcggatca 


gaaatgtaaa 


ataactatga 


15900 


catacaacat 


acatgaataa 


acagatatga 


15960 


atgaaaaggt 


aatgctagaa 


atgtaaaata 


16020 


taatttcatc 


agtatactag 


gcatgtcaga 


16080 


aataaaaact 


tccgaaactg 


aagagcaaag 


16140 


caaccccaaa 


gtgcaggaca 


actacaaaaa 


16200 


aaaaagaaac 


aagaaagaaa 


ggaagagaag 


16260 
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aaatatttga agtttaaata atgactgaga 
cacaaatcca ggaagctcag aaaataccat 
5 ggtatataat attcaaaaac agaaaatcaa 
aggagaaaag gaaacttaac tagagaggaa 
cagaaactat gcaaacaaga agggggtgaa 

10 

gttcaggcac cagtcacact tccttgttac 
cagccctgga gttctgccta tgtgtgttgc 
15 ccattttaag agtgccagca tgttccatag 
tgaattaata tcctcatatt ggaagaaaaa 
acatggtagg cagtatttat ggcaagggtg 

20 

tgagataatc agatgggaag aacgaaccag 
gtccttagaa cgtacctatt ctaactctga 
25 cattgagttt agtttctgag ttgagattaa 
ccttactgtc ttcagccact ttctttctca 
atgtgaggag acttcatgta ttctctgtgt 

30 

cgtgctccct cttagctctt aatcttatgg 
aagagccata atcttgttga atcaaagtcc 
35 tttaattcat ccactgtttc attaaagcaa 
tttgcttgga tttctgaata tagtacaggt 
taattcaaag gagagactgt aaaaaatata 

40 

actggcattt ttttagcttt ttgtggaatt 
aatccggtgt attagaaaca tgactttagg 
45 tacttgccag agatagttgg tgattattat 
gagattagac atctctataa ttgtgggcct 
aaagcttttg tttcctttga taacttatat 

50 

gtaagtcatt ttctaagagt ttcagattaa 
tgcagagcag taaattggtc agagctaaga 
55 tgatttgtat acactacttt ggtgtcagca 
accctgtcct taggggacac tggcatttat 
tctgaggtta agtgttaaga attttaagag 

60 

tacatatgta ctatttgaag aaccatttct 
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tttttccaaa 


attagtaaca 


gatgccaaat 


16320 


gcatagtaaa 


gagtaaaaca 


tctatacata 


16380 


agacagagaa 


aattttgaaa 


gaagcaataa 


16440 


caaggataag 


tattacatta 


gtcttctttt 


16500 


atgaagtatt 


taaaatgttg 


gaggttaaag 


16560 


agtagacact 


ggttgagatg 


ttaggtcttt 


16620 


tatgttgtac 


acaggtcaac 


tcaaactctt 


16680 


agctcagtat 


atctcaaggc 


ctgggatttg 


16740 


tactcatgaa 


atgtggtggt 


gatgagatac 


16800 


gtgcttaaaa 


aagctggttc 


atattggcaa 


16860 


gtgtagaatc 


actggagatc 


aggccggctg 


16920 


ggatgaagaa 


acaaatatta 


ctctacatcg 


16980 


aacttggatc 


atctcactcc 


cagcgatgta 


17040 


gacattcttc 


acagtaaata 


tttcttgcac 


17100 


cctggttttc 


ccttagtatc 


attgatcatc 


17160 


ttttaatact 


tttttcctag 


aaaccataac 


17220 


gttggcctat 


caaaatgttt 


ataacatagt 


17280 


attataatga 


gtgactgctg 


gagaatagta 


17340 


agcctggaat 


agctaaggaa 


tctgtgttca 


17400 


tcaaaacact 


aaagctggtt 


ttgaataact 


17460 


tattttaact 


gcctcaaatt 


cacaagataa 


17520 


acacttgatg 


aaatgaaaat 


gttgggaaat 


17580 


cctctgtgaa 


actcaatgtt 


caagggaagt 


17640 


ttagaaggca 


tgtgatgaaa 


taaaaaaaag 


17700 


atttgttcaa 


tttgactgac 


ttgattggct 


17760 


aacttaattt 


cattagttgg 


caaaaatggt 


17820 


gatcttgtaa 


tgataaagaa 


gagaagaatg 


17880 
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tttgcatatt 


aatatacata 


ggcaccagta 


18000 


tgttttctta 


agaataatac 


ctttctttgc 


18060 


taagcacaaa 


ggaaacccat 


atgggaaaaa 


18120 
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aaaacatttt gtgcttttaa tttagaatta 
aggatacaaa ataatatttt ttctagattc 

5 

cacctcaact ttacatgact cctacttcct 
tctaatcttt tattccagat atctctaagg 
10 atataatttt ctctgatact gtatctcagt 
attctctgca ttgtttctat tttacccacc 
ggtgtcagac attgagaaat gttgaaatca 

15 

cagagataca atgaactggt aagaaaaaca 
taatcttggt ggattaaaca catctatttc 
20 ggtctgggag aaagtcataa atccaaaaat 
ttttttaaat gttagaaacc agctagacaa 
tgaaaacctg actgacggaa ttggaagctt 

25 

cagtaagtct caagaattat aagtgctttc 
gtctgagaag gtagttgtat tagtatgttc 
30 tgggtaattt acaaagaaaa agaggtttaa 
cctcagaatc atggtggaag gcaaaatgca 
gagagccaag caaaagggaa aaccacttat 

35 

ttaccacgag aaaagtatgg gggaacacgc 
cctcccacaa catgtgggaa tgatggtagc 
40 ctgccaaacc atatcaatag tgaatactta 
aagatccctg aaggcctaga gtataaccag 
caggaaggga gagtgtatac agaaaaatac 

45 

atagacactg agatttacat ttatggtgga 
agaaaacagc atatgaaaca tgaggcagtt 
50 gaagcgtaat cataacactc ataatgaatt 
agttaactaa aaaatcgtga taaaactata 
gtgggaaggt gtgtccaggg taaaaatgaa 

55 

agtaggaggt tagtgcataa tatataaaac 
acttagaaat atggagtcaa cacaaaaaga 
60 agaaagaaaa ttgattggga actggggaag 
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tataggcact 


ggaaaaatat 


ttttattaaa 


18180 


aaaatgagga 


attacagtgc 


ttagatctat 


18240 


aatatatttg 


ttttcacatt 


ttccctctgt 


18300 


cttttaccaa 


tctctttagc 


tgagattgta 


18360 


tcactgctga 


ctgatgtgcc 


acctacatcc 


18420 


caatagcatt 


caacagaaat 


gttcctacct 


18480 


aaaagataaa 


taagatacat 


tatctgtcct 


18540 


cttataaaaa 


tataattttc 


aaataatggc 


18600 


ctcatgaaac 


ccaactaaat 


gactataaag 


18660 


tcaaaaataa 


tatgagagaa 


attggatcat 


18720 


gcaataaatg 


acttagcagc 


ctccggaaag 


18780 


agaaaaaatt 


tgattctctt 


tgtagaaccc 


18840 


agttgaatat 


gtctggggac 


acagctaatt 


18900 


tcacattgct 


aaaaaagaca 


taaccaagac 


18960 


tggactcaca 


gttccacatg 


gttggggagg 


19020 


catcttacat 


ggcggcagac 


aagagagatt 


19080 


aaaatcatca 


gatctggtga 


gacttattca 


19140 


ccccatgatt 


caattatctc 


ccagtggttc 


19200 


tacaattcaa 


gatgaaattt 


ggtggggaca 


19260 


agtaagtgaa 


ttccctgaga 


atagtgaatt 


19320 


tcctaactgg 


aataggagaa 


tataaggcgc 


19380 


taatgaaaca 


gatcaattac 


ctgtgggtct 


19440 


gtatttaaaa 


caacttattt 


tcaggtatgt 


19500 


actaacttca 


gagagaaaaa 


tggtcaaaag 


19560 


agccattaat 


gatttaaata 


ctataaattg 


19620 


tgggagaatg 


agggaaaagg 


aagtgttatg 


19680 


ggatataggg 


gttagtctat 


tgtcttcagt 


19740 


caaaaattag 


aacagtattt 


aaaatcattc 


19800 


aacaattaaa 


atttaaatac 


ttgcttctgg 


19860 


ggggcactaa 


tgatttcttt 


caggctaaat 


19920 
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ccagtgaaca 


ttttactgtt 


gttagcttgt 


ctctctgaag 


tatttaataa 


taattgaatg 


19980 




atttcttcct 


actgaaatat 


ttctgtgctc 


aaaaaatgat 


aagtaattta 


tttatattac 


20040 


5 


ctaatataac 


cctttcaaca 


aegctatgae 


atattactat 


ctatatttta 


cagatgagta 


20100 




aactgaggtg 


cagaaagect 


aagcaacttg 


ctccagctca 


catagggcat 


aggattcaaa 


20160 


10 


atccagctca 


aggcegggtg 


cagtggctca 


cgcctgtaat 


cccagcactt 


tgggaggctg 


20220 


aggagggtat 


attacttgag 


atctggagtt 


cgagaacagc 


ctggccaaaa 


tggtgaaacc 


20280 




cagtctctgt 


taatatataa 


tataacatat 


tatatagaga 


tacattctat 


tatatatata 


20340 


15 


atatattcta 


ttataatata 


atatatattc 


tattatataa 


tatatattat 


atattatatc 


20400 




tattatatat 


aatatatctg 


tatatgatat 


ataattatat 


ataatatata 


gatattctat 


20460 


20 


tatatataat 


aggatatata 


ttctatatat 


atagaatata 


tatatttctc 


tgtcaaattt 


20520 


catgttattg 


ttctcttgaa 


attgttatct 


gctaataata 


acctgatact 


gtctcatttc 


20580 




ataccatatg 


gagtgggagg 


agagactgat 


gcctgtcctt 


agtccagaat 


ttcaacctct 


20640 


25 


cttaggaagt 


catgaaactc 


tgtacccttg 


tettctgeaa 


tgccacattt 


tccgagtttg 


20700 




cctgctatct 


tccttaccct 


ctttttcata 


atcctctggg 


aaatcttatt 


tctgattata 


20760 


30 


ctgtctctat 


cctctagaaa 


aagggtttga 


cgttttctgc 


ctccacccac 


aagcctgggt 


20820 


gacctcatcc 


etctgggega 


agagctgttg 


ggtttacctc 


tatagecatg 


ttctcattcc 


20880 




ctagctcctg 


ccagttatcc 


acttcttgtc 


aactttattt 


gaaggtccct 


acacaactct 


20940 


35 


gatcaaatac 


gttcaaaatt 


ttactatttg 


ttcctgcagt 


tctgtcctgg 


tccatgtcta 


21000 




ttgccttact 


gaatggctct 


atcactcatt 


cagtttccta 


acccagaatc 


tgagtgtcat 


21060 


40 


cctttcctgt 


tttttctcct 


gatttacaca 


tccattatac 


caacaagtac 


tgtcatttct 


21120 


tgttcctaag 


catctttcaa 


atccattcat 


tctctccaac 


ttcactagca 


tcacaggtag 


21180 




ttcagtaacc 


atcatccctt 


ccctagagaa 


ctgcctctac 


ettgagecta 


attctctgat 


21240 


45 


tttattatta 


tttattcctg 


ttgtctgaaa 


ccaggacaat 


atttctgaaa 


tataatctat 


21300 




tttaattttg 


agaategtaa 


tttcagatta 


tagaccacat 


agtttttgga 


attaagtaaa 


21360 


50 


actggatttc 


aatctcatct 


ataccagcta 


tacgaatatc 


tctgagtttc 


agtttcttct 


21420 


tctgaaaaat 


agaaataata 


atattacctt 


ccttcattcc 


ctttccattt 


gtgtgtcttt 


21480 




tgctcattca 


tcaggtctcc 


aattagatgt 


ccccttatct 


aggaagtctc 


ctctgcctac 


21540 


55 


rraat"al*t era 


t - ra t" f era t 






aatatcctac 


tttacrttar 


21600 

** JL V V W 




agatcactga 


ttacaatggg 


ttgtaatgat 


gtatttactc 


atctgtattg 


ccactttacc 


21660 


60 


ttgcagataa 


ctgatagtaa 


tggattgtag 


tgatttattt 


actggtctgt 


ttttccaagt 


21720 


agtctataaa 


ctctttgagt 


gaaaattatt 


ttttaccatt 


ttttcagaag 


cacacggcac 


21780 
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5 



15 



25 



actgcttggc 


ttatagtagt 


ttttctaaag 


actttgcttt 


cctcctccat 


gtaaagtagg 


ttcaggagat 


taacagggcc 


atgcaagaaa 


accagctcgt 


tctatgatac 


cagctgcttc 


atacaaccaa 


cttgttgcca 


ttttatggcc 


catgtaaata 


aaacatttaa 


gttgtgtata 


ttggccataa 


aataatggca 


aaaactgcaa 


atagtagtgc 


tccaggaaga 


caagagtact 


atttacaaaa 


taccatttac 


ttgtttatat 


taaatcctct 


accactgata 


tccacagtaa 


tatcaggcag 


cccctgcttt 


ttcagcaaat 


aacacaaaat 


cagttaatgc 


ttccgatata 


tctcagcaca 


gcctttaagg 


ttccaaacat 



agtgaggtgt ggagtggggg aagttgattg 
30 cccagagccc tgcatgcatc agacctacag 
ccagggactt gccttcctag gctttcgggc 
atttctggga ggggaatgag aagaataatg 

35 

cagggttggg gtgggttgag tgcaccttta 
gtttttgcta cagtcgatag gagttagaaa 
40 ataaggttgt ttagttactt ttgtcccctg 
acattttaaa cctttgacta gttacctcat 
tattctctgt gtctgactgg ttatttttat 

45 

tgcagaattt ggtgtacatg gagtagggca 
atgggctgtg gcctgaccta gttctccagt 
50 gcagtaaact gaggctgctc atgataaaat 
agcattctta ggataagaaa gaagcttgta 
gatgaagtaa aagaagccaa taaaagggaa 

55 

aaagggctgg gagaaaaaca aaaacaaaaa 
gcctttcccc cttttccctt cccatgccac 
60 ccatccagat tcaggcctca ttttgggttt 



aaaatgctga atgaatgaaa actgcacacc 21840 

cattatttta ttttacaaat gaatcttagg 21900 

tgtttgccat tggtttgatc ttagtcaaag 21960 

cttcacttag aaaggtgttg ttttattgcc 22020 

actcctgatg aatctctaaa ggtggaattt 22080 

ttaaggttgg tgcaaaagtt aattgtgatt 22140 

ttacttttgc accaacctaa taaaatagta 22200 

gtggaaatgt gtgaccctgt gaaagggggt 22260 

aacgactttt gaagagagtc tggcatacag 22320 

ttagcagtat tattttcata ggaaagactg 22380 

ttcaaaatag acatccacac ccatattggg 22440 

aaagaggaac tagctgtgcg tctcagaaat 22500 

ctgctagaag aggaatgcag atttaaactg 22560 

ggtctagacc aaagaacttt gaggaacttg 22620 

cagacattgc aggcctgaag aaaggtaggt 22680 

tgatggcttg tttcacttcc tgggttaaga 22740 

aggccacttc ccatgaccca agctggtatt 22800 

aatgtgcatt cggtgggggg atgaggaaca 22860 

agcctcctaa aggctgacat tggaggctgg 22 920 

agtcttcagg atatcaaaac actgaaagca 22980 

ctactgttaa tgcccttctt ctgtggtctt 23040 

aactcaggat ttaagaacag atgtatcaca 23100 

caggcagggt ccccagcctt gtacatttag 23160 

ggtctttggc actattcatg gggcacccag 23220 

ctgctaatga aaaacacctt gtagactttt 23280 

agtggtcgtg gccattgtaa tctttcccca 23340 

ataatgtacc cagaatcaca caagttaaca 23400 

caaaaacaga attcttggtt cctagcccgg 23460 

cacttctgca tccccatcat ccttctaaaa 23520 

ctagtctatt tgacaacttt gccaagaaaa 23580 
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acggatggct gtttgtacca ctctcctcat 
aggatggtga cataaaaatg aggaaataag 
5 actggggcca atagaagagg gaagggtggt 
tgggggaaag acattgtatt tcttaagagc 
aggtttaaat tcttctccaa cctggaagaa 

10 

cagttaaaaa aaaaattgaa tatagagaag 
tcttggggac ttcatgtgcc gtagataatt 
15 gaggcagtta tatgaattta tggccagagt 
tgaccagaac ctgacctatg ttaccaaaga 
gctttatttt gatataattg ctctgccctc 

20 

ctaaacacag agagagaacc accagtgcgt 
tctcagtctg aaggtgtgcc tgagagattg 
25 taaagtacac atttagatgg actaagaaga 
gacttttgca atgtagaagg aacataacac 
ctgactaatc cacatgtggg caatggtggg 

30 

agagactata aaatgctgtt catgtgtgag 
gttcttacaa ggtggtcaca agaggggtgg 
35 agatgtctcg atataggaat gaaattacgt 
cttggagctt taaaaattaa aaggagagag 
tgaagtagcc gagcttttgt tatcaacatt 

40 

aaatataaaa gaaggctttt aaaggggatg 
ctggctacaa catcagtggt ggattgaata 
45 tcagaggcta taatgaatac ttctgataaa 
ggagaaatac acagagttga aaaatggttg 
agagggtagg caatggaagg aaccaatgac 

50 

cactgtgttg cccaggctgg agtaccctga 
cagcctcctg agtagctggg attacaggcg 
55 tttgtgtaga gatggggttt caccatgttc 
gcggtcctcc caccttggcc tcctatactg 
agctttaaac ttccagattt ttaacatgga 

60 

agaaaggaca cttgtaattt catagccctc 
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gctctctcct 


tgcaattgtg 


acccatgagg 


23640 


agactggtgg 


tgctgctgat 


ccacaaggtg 


23700 


ctgggtttcc 


ccacctcttt 


atcaggggtg 


23760 


ctgatgcata 


taggtgatgt 


ggcaaaacca 


23820 


atgtccctca 


tatgcatcca 


atattctcag 


23880 


tcaagccctg 


gatggtaaac 


acgtttttgt 


23940 


aacacaagga 


tttctgtggt 


ggaaaaagag 


24000 


ttcttagaat 


gacaaagcac 


cagccttgaa 


24060 


attatgggaa 


agcataggag 


tcagttacaa 


24120 


gaggaagctt 


acagtcttac 


cagagaggca 


24180 


ttgataagac 


agacccaaca 


tgtcctaaat 


24240 


aggcagaggt 


tcttttacat 


taaagaagaa 


24300 


atcatcttta 


gtcactttcc 


aaataacttt 


24360 


atttcttaaa 


cccttattta 


ccgtcatagg 


24420 


gaagtggggg 


aatagtgtct 


gagagaagat 


24480 


aagattagat 


gatgtggtgg 


gtttgttttt 


24540 


aacattcctg 


caaatggttt 


caatatatgc 


24600 


ctttggaaca 


acttaaataa 


gtcaaatata 


24660 


attcgaggta 


agactacaat 


agatttgtgc 


24720 


aaaaatttta 


actttatgct 


aagggtggtg 


24780 


gtgactcaat 


tgtttttgta 


tttgatcatt 


24840 


tccaaagcca 


caaatggaca 


cagggaacat 


24900 


aatagaataa 


tggcttgcct 


tgagaagatg 


24960 


tgtaggatta 


gaatgagatt 


gattgagcat 


25020 


aaactatcag 


atatttttga 


gacagggtct 


25080 


tgccattgtg 


gcttactgca 


gcctgggcct 


25140 


cccaccacca 


cacccagcta 


atttttgcat 


25200 


cccagtctga 


tctcaaactc 


ctgggctcca 


25260 


ctgggattac 


aggattgagt 


caaaatgccc 


25320 


tagtgtagtt 


aattgctgga 


ccttccacca 


25380 


ttagggttcc 


tgttcccttt 


agctcttgcc 


25440 
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actcttgatt agcaactact tacagggctc 
ccaccttact tttttgaaag ggaaattttg 

5 

tctaaggttt gaaataaaaa taacctcttc 
atttatgggc tttattagaa tgactttaaa 
10 tgtaacaaaa tttgaggatg tggattaaat 
tagccatatg ctccaaatct aaattttgtg 
taggtctgaa tatacctact ccatcgcata 

15 

tcattggaac agagactgca agaggaagac 
ttgttatcta acttattcca aattgggtgg 
20 tgctaagaaa attaattgta caagtttttg 
tagtttagtg gatcccctta agcaaaggag 
ttggccctgc cactgtttac tcaagttact 

25 

gaataataga atatatctct caggggaatt 
aaaccagtgc gtgtcactta ataaatgcta 
30 ctactatcac ttactcattt ctggcttgaa 
ctgccactca aggacatgct agacagggac 
tctccggtct tggggaaaat agaaaagcag 

35 

atgtttttca tggtaccatg tggtcagagg 
caaagattgg tcagaagagt caggattgtc 
40 aagactatgc agaggcgggc ctagaggctc 
gataatggag tttatgaggt gggttacatt 
tatatcgaca ctttagcatc ctctgagcag 

45 

taaaataaaa tagcaggcct tattttttgt 
ttaaacatca gagaactgtc cccctctggg 
50 taaattaaca atttaccaac tgccccagaa 
tagaagacct aaatagaaga tattcggcct 
gtgttacagg aaaagtatgt ggaaattatt 

55 

agtggcaaga actcaacgta acagaaaccc 
tcttggtgga aggcccagca attaactaca 
60 tttgggcatg gtggctcact cttgtaatcc 
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tgaaataatc 


tccacatagg 


aagatttgcg 


25500 


aggaagactg 


gatgtcttct 


atttaggtct 


25560 


tcccttctgg 


gacaagtagg 


taaattgtca 


25620 


ttaagcccag 


agaaggacaa 


ttcccttaag 


25680 


gtaagaccaa 


aaataagacc 


tattttattt 


25740 


ttctgtctag 


gaagtgctaa 


agtgtatata 


25800 


taaatgcaaa 


ccacctcaca 


aattccatta 


25860 


aagggcctct 


tggcccacct 


ctacgtagtc 


25920 


tttctacctt 


ttagctatta 


tagataatgc 


25980 


tgcaaacaag 


agatcttcca 


ttcaccagac 


26040 


ttctggaatt 


acatcacctg 


gatttaaatt 


26100 


taatcttgtg 


tttctataat 


tctgtcctgt 


26160 


taatgatgaa 


ataacacata 


taaaacattt 


26220 


aatacatatt 


gactattatt 


actggtattc 


26280 


gtataaaaca 


taaacttcct 


ttaacacttc 


26340 


tttactcctg 


tcatgaccct 


gctccaacct 


26400 


ataagagggg 


tctatataat 


tggtgagagc 


26460 


ttgactcttc 


ctctcttttg 


cagtttgtct 


26520 


ctcctttttt 


gctctcagta 


agttagatac 


26580 


tcctcttcct 


cttgcagtct 


ctgttctaat 


26640 


cacatgctat 


ggagtaggta 


cattcagacc 


26700 


aacacaaaat 


ttagatatgg 


atcaaagggg 


26760 


cttaaattta 


atttgtactc 


gcaaattttg 


26820 


cttaatttaa 


agtcattcta 


ataaagccca 


26880 


gggagaagag 


gtaattttta 


tttcaaacat 


26940 


tcctcaaaat 


ttctctttca 


aaaaagtaag 


27000 


tcagaaccct 


acaagcagcc 


tctaatcaag 


27060 


taagagggct 


atgggactgc 


aagtgtcctt 


27120 


ttgtctgtgt 


taaaaatctg 


taagtttaag 


27180 


cagcactact 


ggatgctgag 


acgggaagat 


27240 
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cacttgagcc ctggagtttg agaccagctt 
aaaatacaaa aattagctgg gtgtggtggc 
5 ctgaggtcga ggctgcagtg agtcacgata 
atgagaccct gtctcaaaaa agtctggtag 
aacataggtt cttttctatt ttccctaagt 

10 

gtacaaaaga gagaaatttt acagctaggc 
accgtgatgc ccccgagctg caaaaccagc 
1 5 gaatgtacaa atagggagtg ggtcacagag 
acaagggcag aagggcagag caaggtcaca 
ggttccatgt cctgctgtgc acacattgtc 

20 

tcaagagcag agaaccagtc tgactacaat 
agcctgaggg tgctgcagga gaccagggtg 
25 gcagacaccc ccagagcggc tgtccatagg 
gttattcctt actgggaaaa gaattcagtg 
aaaaatatga ctttgttctg cccggccccg 

30 

tgtttcctga acatcgctgt tatcctgttc 
caaacacaca tgttttacaa tttgtgcagt 
35 cgtacatctt cagcttacga agatgacggg 
gaaattataa aagtattgat tggggaagtg 
attcagagat tgcagtaaag acaggcataa 

40 

aatgaatgtc catgaaatct tcacaattta 
tccgttcggg gtccctgact tcccgcaaca 
45 accctctatg ctcagtcaat cattaaatga 
gacttgaagt ttattttaca tgatctaagt 
tgcatggaac agtccatccc ttaactttca 

50 

ctcttacagc atctggttag atcttgtttt 
tgttcaaaga tttaattcat ttatatttaa 
55 tgacatttta ttattttggg ggggtttgtt 
tggtctacct ttgtgattta gtaattttat 
tgtgtatctg ctgtagttct ttgccttgta 

60 

gttttataaa ttgactattt taaggtgaca 
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gggtaacata 


gagaaacccc 


atctctacaa 


27300 


acaggcctgt 


. aatcccagct 


acttgggtgg 


27360 


gtgccagtgt 


actccagcct 


gggcaacaga 


27420 


tttattggag 


gaaccagtcc 


ccaatatttc 


27480 


gtcagccagt 


ctgagaaata 


aagagaatga 


27540 


ctccgggggt 


gacatcacat 


atcagcaggc 


27600 


aagtttttat 


tagggatttc 


aaaagacgag 


27660 


atcacatgct 


tcaaatggca 


ataaaagatc 


27720 


aggccagggt 


gaaattagaa 


ttactaatga 


27780 


attgataaac 


atcttaacag 


gaaacagggt 


27840 


tcaccaggct 


ggaatttcct 


aatcctagca 


27900 


tatttcatcc 


cttatcttca 


atcgcataag 


27960 


caccccctgg 


gaatgcattc 


tcttcccagg 


28020 


atatttctcc 


tacttgtttt 


ctgcaataag 


28080 


caggcagtca 


gaccttatgg 


ttatctccct 


28140 


tttttcaggg 


tgcccagatt 


tcatattgtt 


28200 


taacgcaagc 


atcacagggt 


cgtgaggcaa 


28260 


attaagaaat 


taaagtaaag 


acaggcatag 


28320 


ataaatgtcc 


atgaaatctt 


cacaatttat 


28380 


gatattataa 


aagtattaat 


ttggggaact 


28440 


tgttcttcag 


ccatggcttc 


agccggtccc 


28500 


gtagtttgtt 


attaatgact 


tctattaccc 


28560 


tgaccttctt 


tgcctctttt 


gacagttttt 


28620 


acaggtaatt 


attctcttta 


ggctagtctt 


28680 


actatgtttc 


ttcttaaggc 


taaagtgagt 


28740 


ttaaaaattt 


tccagccgct 


ctttgacttt 


28800 


ggttagcatt 


aatggataag 


gatttactag 


28860 
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gaattgctaa 


gctttgaatt 


ctttaacgtt 


28980 


ctcatcatta 


ggcttacata 


aaaccacgta 


29040 


ataacaatgt 


ttgttgaaac 


aagaactcta 


29100 
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gacttgtact gtcctcccac aaaactaaat ttttgatatc tttgttttct gctgctgtaa 29160 

aaaaccattg aagactgggt aatttattat aaaattgtat tttgcttata gctctagagg 29220 

5 

ctgggaagtc taaaggcatg catggtgtaa acctctagtg aggataatcc catgaaggaa 29280 

agtggaagtg agcgtacaaa gcagagagag agatggggcc aaatttattc ttttttcagg 29340 

10 agtccactcc tgtgataact aaccttctcc aaggataatg acattcatcc attcatgcag 29400 

gtagaggtct cctgatgtaa ttacctcttg aaggtcccac ctcttaatac tgttaaaatg 29460 

gcaattaagt tttcaaatga gttttagagg ggacattgaa accatagcat ctcatctaca 29520 

15 

accctccaaa actcatgttc ttctcacata caaaatacaa tacttctgtc tcaatagcct 29580 

caaaattttt acccattcca gcattgactc taaagttcta agtccagagt cacatcagaa 29640 

20 gcagatatgg atgagactca aggtatgatt tatctggagg caaatgtttc cagctgtgag 29700 

cctgtgaaat caaaacaagt gatttgtctt caaaatataa tggtgaaaca ggtgcagaat 2 9760 

agactttctc agttcaaaag aaataactag gcaagaaaaa ggtgtaagtg gtttcaagtg 2 982 0 

25 

agtccaaaac ccaaccaaga aaacagcatt aagctttann nnnnnnnnnn nnnnnnnnnn 29880 

nnnnnnnnnn nnnnnnnnnn caccttttct gctgcccacc ttttctgctg ccatgacaac 2 9940 

30 catgcaagga atggaacagg ccatgccagg ggctggccct ggtgtgcccc agctgggaaa 30000 

catggctgtc atacattcac atctgtggaa aggattgcaa gagaagttct tgaagggaga 30060 

acccaaagtc cttggggtaa gttgcaaatc taggggaaga ccaatggtgt tgcaaactca 30120 

35 

tatttcataa gacctaatgc caaatacact gtcccactag cctcataata caacagccaa 30180 

ccctcacgac gtcagtaatt acttttcagg ctcttcatct gaaaaactat atagtattct 30240 

40 gtcacaacat gcttccagtt gagttatatt taaaatatat tgagttcaga tctattctag 30300 

ggttgtgggt catccatgat aattgtattt aaactcagtg tggaaaagca gcaaaataaa 30360 

aggccaccta tgcactacta gctgacgccc cctttacagg ccttcctgat agctgacagg 3042 0 

45 

cacatattgc tactatttct ttctggcacc ttttcttagc cccctgctca gaggatgaaa 30480 

aatctatcct ctgagcattt ccatctggag tctcagatga gcatgcccaa aagaaatgaa 30540 

50 ggattaaaag aatgacagaa tagggcaggt gaaaggatat nnnnnnnnnn nnnnnnnnnn 30600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnntaaatt aggtttatca ctagcccaag 30660 

cttcaaacca cggngaaaat gtcactgtgg atattctgga tccttttttc acattattca 30720 

55 

ttttctggag ggggggaaat atagttattt ctcataaaat atgttattta tgttggcaca 30780 

tactaggttt gtttgtttat ttatttttaa ttttattttt tgagatggag tctcgctcca 30840 

60 tcatccaggc tggagtgcag tggcacaatc tgggctcact gcaacctccg tcttccgggt 30900 
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tcaagagatt ctcctgcctc agcctcctga 
acctggctaa tttttgtatt tttagtagag 
5 cttgaactcc tgacctcgtg atctgcctgc 
atgtatgaat aattatttta aaatatactg 
aaaattgaga gcaaagtaca gatctatccc 

10 

tttttaatta tcaacctcct ccaccagcat 
tgacacatca ttactaaaag tacacaaatc 
15 tctgtgggtt ttaacacaag tatagtgata 
cccaccaatc cctggcaacc acaaatcttt 
agtcatacag ttggaattgt acaataggtg 

20 

tacatgcatt taaatttcct ccatgttttc 
actgaataat atttcattgt ctagaagcac 
25 ggacatcttg atggcttccc aactttggca 
gtgcaggtca tgtgtggaca taagtttttc 
tgtaagtaag agtacgttta gttttgtaag 

30 

gtcactttga agatttcaat caacagtcaa 
caattggtag tgccagtgtt ctggattttt 
35 ctatcatgtt aatttgcatt tcccttgtga 
tctttaccat ctctatatct tctttcgtaa 
taatgaggac gtttatgttc ttattgttaa 

40 

gatcctttgt cagatatgac tcttgcaaat 
atctcttgac aatgtcttct gcagagcaga 
45 ttattctgta tttcatggtt tgtgcctttg 
caaggtcatc tagattttgg cctatgttat 
tatttatgtc tgtgattcat tttcagttaa 

50 

ttattcttac ttatggatat caagttttaa 
ttgtattact ttagctcatt accaaagatc 
55 gctttctact ctattccatt atctatttgt 
attactatca ctttatagta tgtcctaaaa 
tcttcttctt cttcttcttc ttcttcttct 

60 

ttgtgtttgc tattctgagt tttttgccat 
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gtagttggga 


ttacaggcat 


gagccatcac 


30960 


acggggtttc 


agcatgttgg 


tcaggctggt 


31020 


ttcagcctcc 


caaagtgggt 


ttatttttta 


31080 


tatgttttag 


agcagtttta 


ggcttaaaga 


31140 


atatgcccct 


tgcccccaca 


aatgcatagc 


31200 


gtaacctttg 


taccaattga 


tgaactacat 


31260 


acaatggggt 


tcattctctg 


tgttgtgtgt 


31320 


tgaatctacc 


tttgtagtat 


cacatagtat 


31380 


tattgtcttc 


atagtttgcc 


ttttccagaa 


31440 


ttttttttag 


attgcgttct 


ttcacctaaa 


31500 


ttatggcttg 


atagctcatt 


tatttttagt 


31560 


cacagtttat 


ttatgcattc 


acctactgaa 


31620 


attaggagta 


acattgctat 


aaatatctgt 


31680 


aactcatttg gttaagtatg attagtgtat 


31740 


aaactgccat 


actgtcttcc 


acatggcctt 


31800 


tgagttcctg ctgttctgca 


aaatctccag 


31860 


accattctaa 


caggtgctag 


tggaatctca 


31920 


cataagatgt 


ggaccatctt 


ttcatgcgtt 


31980 


ggtgtcttgt 


taaattcttt 


ggcacatttt 


32040 


gttttaaatt 


tgttgtatat 


tttgggtaac 


32100 


atcttctccc 


agtctgttgt 


ttgtcttttc 


32160 


actttttaat 


tttaagaaaa 


ttaagatgat 


32220 


gtatcgtatt 


gacaaattca 


tcaccaaacc 


32280 


ctgttaggag 


ttttatagtt 


tttagtttaa 


32340 


tttttgtgaa 


agatataagg 


tatgtgtctt 


32400 


gtactatttt 


ttaaaatact 


ttttttttca 


32460 


acttgactat 


atttgtgtgg 


ctctctccag 


32520 


ctatttcttt gctaatacca 


cacagtcctg 


32580 


ttgggtagtg 


ttagtcctac 


aactactact 


32640 


tcttcttctt 


cttcttcttc 


ttcttctaga 


32700 


tccttataaa 


ttttagagtc 


agtttatcaa 


32760 
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tatatacaaa ataacttgct gggattttaa 
tggaaagaac tgacatcttg acaatgttga 

5 

tccatttatt ttatcaaagt ttgtagtttc 
tatacctaac aaatgccaaa taccaatacc 
10 aaatggcatt atgtttttaa ttccaaattt 
aattgtctct tgtgtattaa tcttgtataa 
gttccaggaa tttttggttg attctttcag 

15 

aacaaaggca gttttatttt tccctgacca 
ttactgtttc tgctatgact tccagtacaa 
20 ctgccttgtt cctgatctta gtgagaaagc 
ctgtaggttt ttcacaaata ttctttatca 
ttttcaagag tttttataat aaacaggcat 

25 

gttaatataa tcatgtgatt tttcttcttt 
gttttttgaa tgtatagtca gtcttgcata 
30 acaatttttt tgatgcattg ttagatttag 
tatttgttca tcagagatgt tggtctgtag 
ttttggtaga tggtagtgct agcttcagag 

35 

attttctgta aataatggta taattttttt 
tccttctagg tctgatgctt tctgttttgg 
40 aataattata ggcttattca gagtgtctct 
aggagttatc cattttatca aggctatcaa 
ttttattatc cttttaattt ctatgggaac 

45 

agtaattttt gtcctctctc tttttcttag 
tacatacata tatttttaaa aactgctttg 
50 gttttcaatt ttatgatttc tgctctaatt 
agtttaattt gcacttcttt ttctagtttc 
gatcatgatt ttctaatata ttcattcaat 

55 

actacatccc ataaattttg ataagttgtt 
ggataatggt actcttactc tgctacgttt 
60 ataccccaga cttcaagaaa atatcatatt 



tggagattgt 


attaaatcta 


ctgatcaagt 


32820 


gtcttcctat 


ccatggacat 


ggaatatctc 


32880 


ttatagattt 


tgtacatttt 


tttttagatt 


32940 


taaatatttt 


atttttagag gtgctaattt 


33000 


cacttattca 


ttataagtac 


acaagaaagu 


33060 


atcttgtaaa 


cctaatataa 




33120 


attttctaca 


cagacatcat 


gtcgtctggg 


33180 


gtctgtatat 


ctttatttcc 


ttttcttgtc 


33240 


tgttgaaagg 


agtagtgaca 


gggctcatcc 


33300 


tttgagtttc 


tccattaagt 


atgaagttag 


33360 


acctgaagaa 


gttctcctct 


attaataggt 


33420 


tggattttgt 


tcaatatttt 


ttctgcgtct 


33480 


agcctgttga 


tgtgatgaat 


acattaatcg 


33540 


cctgggataa 


atcccaagtg gtcatggtac 


33600 


tttgcagata 


ttttgttgag gatttttgca 


33660 


ttttcttttc 


ttgtaatgtt 


tttgaatgat 


33720 


aattagttaa 


ggagtatgtc 


ctctgtttct 


33780 


cttaagtgtc 


tggtagaatt 


caccactgag 


33840 


aaatttatta 


attattgatt 


caatttattt 


33900 


tcttgtgtga 


gttttgacaa 


attgtcttta 


33960 


actgtgggcg 


tagaattgtt 


cataacatta 


34020 


tgtattgatg 


ttccttcttt 


catttatgtt 


34080 


gctggctaga 


ggctttttaa 


attttattta 


34140 


gattttattg 


attttgttta 


ttcatgttct 


34200 


tctattatag 


ctcttcttct 


gtttatttgg 


34260 


ttaaggtaga 


ctctcaggag attgatttta 


34320 


gctatatatc 


ttcctctaag 


cattgctttc 


34380 


gaaatgtttt 


tgatcctttg gaatgccatg 


34440 


agatccctgt 


gtactcatac 


ttgtgtcctc 


34500 


tgcttttctt 


gttttcttcc 


cttgtatata 


34560 
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ctctcctttc ttcaggttac ttctccttgc 
cttctctgag ctcatgagaa aaaaagagaa 
5 tcagaaaagc acctctgggc tggttctggc 
atgatagacg tcgcgagaga taatattggt 
aaagagtgat aagagaatac taagaaggtc 

10 

tagcttagtg attggaaagg gggtggcgga 
ttgctttctt caccattttt aggttgtgca 
15 gggaataaca atgatgtgta tggcatctaa 
tatcgggtac acaatttggg ggtcagtaat 
agataacata aagcagttgt caataccctg 

20 

gtaattctgt aaatctgaga gtggtatcta 
aagtactaca aataatatag tattatatta 
25 cagttagata tattacagca gcacattgcc 
ggaagcactg attatttcac aacaagacta 
cagtggtgca aggatttcat caaggattgc 

30 

acacatccaa cttttggtaa atatttcata 
ataacggact tgccattatt ggcttggact 
35 gtctacccta cctttagctc atgagaaact 
gaagaggaga tgtggctggt acacaagcag 
actcaggaaa ttgttacctc agaaaatgaa 

40 

ttcaataaaa taactaatta ttataattaa 
tacattaaaa ttagaattgg aatttaaatt 
45 atagttttaa tgcaaattga gagtaaaatt 
agagagaaaa ataaaattta taagttgggt 
attgcagggt ttccatttat ttctaaaact 

50 

gagcttgctt tttgatcccc ttcacctgtt 
taaagctgtg ttttacttct agaaagctgc 
55 gccaataaca gctcttatta aactcttact 
atgtgcaatt tcatttcaaa caatcaccac 
atggatgaag agaggttaag taactaaccc 

60 

gattcaaata gggtcaatgt attttcagaa 
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tacaataatt 


gctttataat 


tcctaggcca 


34620 


ataattgtgg 


tagaagtaac 


atcgtaggta 


34680 


atactccttt 


taaccttgaa 


attattggca 


34740 


aagattacat 


tcacggttat 


ttatgtgaaa 


34800 


atatgactat 


caaatgaagg 


agaataagga 


34860 


agcaccattt 


ttgtggtgga 


caatcagttt 


34920 


gattctgact 


gccctgatga 


gccttagcat 


34980 


tacttatgga 


agtaacccta 


tttccgtgta 


35040 


ggtgagtaga 


gtatcttttg 


atataattga 


35100 


atctttggaa 


tcctattcta 


accgtatctt 


35160 


accatttctt 


actcaatttt 


ctcatattta 


35220 


aatctaggca gattaaatat 


ttatatatgt 


35280 


aaagacccaa 


accagaatgg 


cttaaacatt 


35340 


gtggcttatc atttctagga 


tggcttcatt 


35400 


tagcagacca 


catcagaggg 


gagatggttg 


35460 


tacatattcc 


caaatcaact 


gtcagtaagg 


35520 


aatcaagatt 


tacctctgta 


gctgggagga 


35580 


gaacaaaatg 


atggtttagt 


tagcaaggaa 


35640 


tttctaatat 


ctagtctact 


gactatcgaa 


35700 


aatgttattt 


gatatatttt 


gtttcataaa 


35760 


tattaaatta 


attataacta 


ttattaatat 


35820 


taaaaacctt 


ctcccaagag 


aaaacacaag 


35880 


gagaataaaa 


taccaagttt 


tatccattga 


35940 


ttatgttaga 


atcagggaat 


ctattctagc 


36000 


gagaatgagg 


tagtgtagag 


ttttatagaa 


36060 


tacagcagaa gtgcaagatt 


ggaagaacag 


36120 


acaatacttc 


atatatccaa 


aaatactgat 


36180 


atcatgtgga 


cactctactt 


y y 


3624 0 


gaacttctga 


tttttcatta 


taatcatttt 


36300 


aacatcacaa 


ctgcttggta 


caggaatgaa 


36360 


ttgcatacat 


ctaactactt 


agttttctgt 


36420 
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ttttatttcc gtttcctttt caagatcaca 
ccacacatgt tcagtagatt accaggaact 

5 

tgagactttc gtaggtgttc cattagatga 
cggctacctt ccattattct ttcaaccttt 
10 tttttctctt tcatacagta taagtaccaa 
ttttggcctc ttcagttctg tttctgtata 
atagtacata catacttttt cctgtggcaa 

15 

aatacacaga catacataca cacttgtaat 
ttccaaggtg ccttattgaa tttttagaat 
20 gaatgtttgc aatatatatt gttcatactt 
ctatttttct tatgtaatta ttttggtaaa 
gcatctttat cttgattttg actttaatga 

25 

ataaagactt ttagactgag acagatttct 
tttttttttt tttggttttt ggtttttggt 
30 caggctggag tgcaatggca cgatctcggc 
ccattctcct gcctcagcct cccaagtagc 
gctaattttt ttgtattttt agtacagacg 

35 

tcgtgatcca cccgcctcag cctcccaaag 
cctgccaata attctttata ttttaccagt 
40 tattaaatgt cctagcatat caacactgta 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnntgt ttgtagagct 

45 

tttatatcat atgatgggtg attccctcta 
catgaatcat ctctatgtct gggtatttgg 
50 tagttctgta ttttttatat tatgatataa 
agcaaattaa aaactccaac aattgaatat 
agttatttta gctaaatgaa aaataattgc 

55 

tctagtcatc atctaattgg gaatttacat 
aaaatattgg ccgggcgcag tggctcacgc 
60 cgggcggatc acgaggtcag gagatcaaga 
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ttaaaaacca 


gcttacatgg 


tgcaagcctt 


36480 


gcaggagcca 


tttgtccgct 


gagacatcac 


36540 


gcctttactt 


aattggtgaa 


ctcttttctg 


36600 


ttttccaaac 


ctcttctatg 


aagtgcctcc 


36660 


taccaagaac 


cagtcacaag 


ttcctcttat 


36720 


tgtttcacat 


tccttgtttt 


ttcctctgag 


36780 


atgaagtcct 


cttattcaat 


cccatcttca 


36840 


gtgtatatat 


aatttctgta 


tagaatgttt 


36900 


agtttatggt 


aatttttagt 


tcacgttctt 


36960 


aaataagaat 


agttaaacca 


cttgttccaa 


37020 


tatacccaga 


atagtattac 


ataatagtca 


37080 


aatgatccta 


catttttatc 


attaaacata 


37140 


atcatgttac 


agaaattaca 


attctttttt 


37200 


tttttgagac 


agagtttcgc 


tcttattacc 


37260 


tcactgcaac 


ctccgcctcc 


cgggttctca 


37320 


tggaactaca 


ggcacccgcc 


accacacccg 


37380 


gggtttcact 


gtggtctcga 


tctcctggtc 


37440 


tgctgggatt 


acaagcgtgt 


gccaccacgc 


37500 


aatgttgcag 


ttactaataa 


tgttgaattg 


37560 


aacatgaata 


gcaataagcc 


tgtnnnnnnn 


37620 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


37680 


gaaggaacat 


cccaaatgtt 


catgatgtat 


37740 


gtggttttat 


tccagtggtg 


atttagatat 


37800 


aagttaaata 


aatttattga 


gcttttttat 


37860 


tctttgaatc 


atctgaaatc 


aaatttgaaa 


37920 


taaaagatag 


agagtctaac 


caacttgact 


37980 


ctattgatgg 


aatcaaatga 


actactatat 


38040 


ttttttgcta 


ctattttgct 


tataagttta 


38100 


ctgtaatccc 


agcactttgg 


gaggccgagg 


38160 


ccatcctggc 


taacacggtg 


aaaccccgtc 


38220 
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tctactaaaa acacaaaaaa ttagcccagc 
tcgggaggct gaggcaggag aatggcgtga 
5 gattgcccca ctgcactcca gcctgggcga 
aaaaaaaaaa aannnaattc taaatgagaa 
gtcatcaatt ttggtaccat gtaagaatta 

10 

tagtgccctc aaacagtttt aatatcattt 
ctcccatgga tttatctggt attatttctc 
15 gcaaaccttt gtggttttct ttaagataat 
aattaagttt agtaagttat gactcccaat 
tctctcattt ttagtttatt atttcaggat 

20 

caaaaggcct ggtgagtaat attttctttt 
aaaatgtatt ctgccagggt gtaaaacagg 
25 gcgattgtgc attttaccta ctagaatctg 
tgacatagga ctggatattt tggcaaggta 
ttgaagatat agagaattat tgagaagaaa 

30 

gcagcatgag tcagcccatg atagtgacac 
cccattgatt ataatgtaga ctccaactgt 
35 agaaattcac aataaggtta ttacataaga 
gatgcctttc ataaagccaa tacaatgcga 
agttaaagtc catatttgag ttttctttgt 

40 

gatcaatatg ggttttccaa atctggccta 
agccacatta ctcatactcc tcacactttg 
45 ttttttctgg tttttggttt ttggtttttt 
tggagtgcaa tggcgtgatc tcggctcacc 
ctcctgcctt agcctcccga gtagctggga 

50 

tttttgtatt tttagtagag atagggtttc 
tgacctcaga ttatccaccc cgcttcggcc 
55 cactgcgccc agtcagcaat tttagttctt 
aagagaacgt acagaaaaga atatactgct 
catgacaggc aaacagcatt tagaaaaatg 

60 

ggcagggaag gacacagaca gggctgtcca 
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gtagtggcgg 


gctcctgtag 


tcccagctac 


38280 


acccgggagg 


cggagcttgc 


agtgagccga 


38340 


cagagcgaga 


ctccgtctca 


aaaaaaaaaa 


38400 


taattttagt 


gtattttttg 


tttaaatata 


38460 


ctttgtatca 


ggagtttgga 


aactatcttt 


38520 


cagttagata 


tttgaaaaac 


ttgataaatt 


38580 


tgttctttct 


gggaaaaggg 


aaaataatgt 


38640 


tggtctgatt 


aacatatcta 


tctaatttag 


38700 


actggcttaa 


agttttttac 


cttcattttt 


38760 


ccttgtcaat 


tgcagcagga 


attagaacta 


38820 


tttggtatca 


aaaaaaggaa 


ggattaataa 


38880 


ttttgtttct 


ttgtttgtaa 


aaaatcagga 


38940 


tctttcaaaa 


actgcagtca 


ggctaattgc 


39000 


gggaatctta 


cgcttgtgga 


gggaatcttt 


39060 


tatcaagtgg 


gaccgctcca 


acaaaaccct 


39120 


caagcattgg 


tgttgtgtaa 


tatttactga 


39180 


gtccagaaag 


gattttgagt 


tttcaataaa 


39240 


agcaaattta 


aaatgtaatt 


ctcatgccaa 


39300 


aagtccatat 


ctgaataaat 


gtttttttct 


39360 


tcaaaagata 


aatatcaaat 


tctggagcat 


39420 


acccaactcc 


ctagtgtccc 


ctccggctgc 


39480 


ctgtgcaagt 


ttagttcttt 


tttgtttgtt 


39540 


gagacagagt 


ttcactctta 


ttacccaggc 


39600 


gcaacctccg 


cttctcaggt 


tcaagtgatt 


39660 


tcacaggcat 


gtgccaccac 


gcccagctaa 


39720 


tccatgttgg 


tcagactggt 


ctcaaactcc 


39780 


tcccaaaatg 


ctgggattac 


aggcatgagc 


39840 
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gagagagata 


atatacaaaa 


ggtatagagg 


39960 


ataagaaatc 


aggcatcaga 


agcaaggagt 


40020 


gctaaagtgt 


gaaggacatt 


gaaaaccctg 


40080 
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ctaaagagat ttggagatta tattatgtgc 
cggaataatg agatcatgtt tgtgttttgc 

5 

gttagaagac tattgaaata atcaagataa 
gtgtaaagtg attcatggag atatatctga 
10 attcctttct aggtccgagg tagtctagga 
tcagggatct taatcaacac atttagcttg 
aactactatg gcaactcaaa taattgtcat 

15 

gcctcttttc aggggatatt attatagaag 
acaataatta atattcccta attgatgaat 
20 agaataatct ttcatttagg agttaggtac 
tatcatgagg caaaatggga agaaagctta 
actgtgctgg gtattttgta tacatttttc 

25 

tactgttatc ccctctaaaa atggggccct 
taaaattgtc ttgaacagta ttttacaagt 
30 tttggatgtt cttgcaaggt taaccacttt 
cagtcttcat tatgactttt actacttttt 
ctaattgctt tattctgaaa atgccaccta 

35 

gatattttgg gccattgggt aattgtgggg 
actaaaatgc acaattaaat ctttattatt 
40 cctgtaatcc cagcactttg ggaagatgag 
agaccagcct ggccaacatg atgaagcccc 
gcatgatggc acgtgcctgt gatcccagct 

45 

gaagctggga ggcggaggtt gcagtgagcc 
aacagagcga gactgcatct caaaaaaaag 
50 caaatttttt gttactttcc ctctatatga 
ctcttcctta ccacgttctt cgcttatgta 
actaagcaga atcagttgca ttctgtttac 

55 

tgggctatat aatcacacca aaataacccc 
cccttgagat ttctattgct ttttatagta 
60 ataagaaaaa gcttctatag agaatttgac 
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ccacgggaat 


catctgaaga 


attgaagctg 


40140 


caagagatta 


gagttgaaac 


cacaagacaa 


40200 


tgatagatgg 


cagtaagaaa 


gagatggtgg 


40260 


ggatgttgtt 


ggattgttta 


attgattttc 


40320 


atgaatatca 


ccagctctgt 


actggctgca 


40380 


gcgttttatt 


cattccatca 


cccttactgt 


40440 


gggactatgt 


ccatcttaat 


ggttggtact 


40500 


caagtttggg 


gagtgctggc 


tctggcaaag 


40560 


gactttggga 


agttggagtg 


attgagaaat 


40620 


atgtctcatc 


tctgtcttta 


ttcttctata 


40680 


tagttataga 


atacctactc 


attaatagtt 


40740 


tgtttaattt 


taaaacattg 


atgcagtacg 


40800 


gatgtcagtc 


cctattctct 


aataatgaag 


40860 


ggtgagtagc 


tcagtatcct 


ttccaacatc 


40920 


tatcctcaac 


atctagctcc 


tgataaatct 


4098.0 


tgtaccaatt 


gttcttatct 


taatgctatc 


41040 


cacaatattt 


tgtaggtagt 


aaaacttctt 


41100 


aggaaaagtt 


attgcagcac 


atcacagcag 


41160 


aaaaagcctg 


gcagggcgca 


gtggctcacg 


41220 


gtgggtgggt 


cacctgaggt 


caggagttca 


41280 


gtctctacta 


aaaatacaaa 


aattagccag 


41340 


actcgggggg 


ctgaggcagg 


agaattgctt 


41400 


gagatggaca 


cattgcactc 


cagcctgggc 


41460 


agaaagccta 


ttggtcattt 


tgaacacctg 


41520 


actttcagcc 


tttcaccttg 


ctgcttagtt 


41580 


aacgtggaaa 


cccaacattc 


ttcctaactt 


41640 


ttcattccta 


gctaattcta 


catagttttg 


41700 


ttgaggtgtc 


tttgctaaac 


taatttactt 


41760 


agcaatttta 


cttcttgcca 


ctcgctcaaa 


41820 


aaattctttc 


aaggtctttc 


agttagaaac 


41880 
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tggcaaaagg aagattagat tctacaattt 
ttgtatcctg cagaaggacc ttacttttag 
5 aagggcacaa tgatagagac aaaatctgga 
ggcatgggac ctgtgcagtt actcaggact 
aatgctctgc tgttgctatc ttaaaattgt 
ttacatagct ggtcctccat tcatctcaat 
gttaattgtg tgaacatctt tatgcctgtt 
15 ctgggatttg aagtaatggt tctattcttg 
ctatatacat gctacctttt tttggaggcc 
ttcataaata aagtttaatt gaaatacaac 

20 

ctgctttcgt gttataacag caggtttgag 
aatctaaaat gtacattatc tggcctttac 
25 ctagatggta aaatccttga agacagggga 
tgtgcctaca acaatgttgg aaacacagag 
taaataagca atgaatgcaa atattctcaa 

30 

tgagctcttg gtctattggc tcttctgcaa 
aaagagagaa tatttaaacc agataatccc 
35 taagtgtact gcccaatgtt ttccaaattt 
nnnnnnnnnn nnnnnnnngt ctggatggca 
gcattgctgt gtccctctct gcctttggat 

40 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
aagcagtggt tttggggcat atgatagcta 
45 ■ ccctatgtca cactgacttt gtggaagaat 
tgggccagtg tggtttttgc aatgtattat 
ttgatgtcat cagcatgaac aacctggttt 

50 

cttggtttac ccaacgtact gtgcactgtc 
accagagcag tggtcatcag cagctcaggt 
55 atcaggcctg atcaagagag tgactgagaa 
agagaagaaa agagtaaact ctgataatga 
tgaggtgatt tgggtgactc acctttggca 

60 

tgccatcaca ttctcacatg gcagaaacag 
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cttgtctcta attcccctat tatttgctat 41940 

tttacgaagt tctcttcatt ggtagatttt 42000 

attggccaac ttgatagggc tgatttcgtg 42 060 

tcacagttaa aagagccctg tgcttgatct 42120 

ttttaaaaat tttaaacaag agaatctaaa 42180 

cacctttccc agagtccatg atgaattcct 42240 

cttatagcag aggacctcca taagggcttc 42300 

cctgggcacc tgcacaatca gtccacaggt 42360 

cagccctgta cctgtttttg taaataaagt 42420 

gaagcccatt tgtttacata ttgtggagtg 42480 

tatgtgtgac agagacctta tggtcggcaa 42540 

agaaagcttg ccgtcttcta gtctacactt 42600 

attgtctgtt ttgtttgctg ctctatcttc 42660 

gccactacag ctgttgaagg cacaagaaaa 42720 

ggtctcctta agtctgcctc actcatatcc 42780 

cagagggact atccctaacc atggagagaa 4284 0 

tgcatcaatc cgggggaaaa ttaatagaaa 42900 

tgaccatgtt tcttggctga agacnnnnnn 4 2960 

tggtgctcct cttaagtgtg ctggaattct 43020 

gtaaagtgct ctgttgtacc cctggtgggn 43080 

nnnnnnnnnn nnnnnnnnnn gaattcagaa 4 314 0 

ggtctctatc tcttcctgtt cttccaccaa 43200 

aaccagaact cttcttagtt ttgtgatgtg 43260 

tcataactag caaatcatga tttgggtatg 43320 

ggaagaagag ccccatattt ttcataacac 43380 

atccacacgt aatattgatc cacttgcgag 43440 

caagagctat cactgaatgt tgtgtcctaa 43500 

accccacata cttgatcttg agaatgaatc 43560 

tgactttatg tgggcaggaa aggggcactc 43620 

ttctgattgt ttcacaggtt gtgttaattc 43680 

catctcccac accacttaat gaggtttgag 43740 
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gccaccaaaa gatcaacaga caaatgctcc 
ctcacatgag aaattaccag tatccaactt 

5 

tatatgtaat ccaattatga actgtgtgtg 
tctcattttt ttccctggaa ctcaataact 
10 gaagttaaat taataaataa tgcatttaat 
atcataagaa ctttatataa aggcattaca 
aaaaaaaaga tattgttaaa atgaggcctc 

15 

tttaactttt tttttctcct acctatgggg 
gatgcatggc catctccccc tccctttttc 
20 taccttgaag tagcacagtc cgtccttgca 
tacatggagt aaaaatcata ttaagcatca 
ttcctttccc ttctcccacc ttctactggg 

25 

gtgcaacata tggtatttgt taaatacgtt 
attagaagca atactttcat g'tgctaccta 
30 agactaagcg ttgatgtcaa aaatacagta 
gacaaccctg ggttttcatc ttgctatccc 
gaagatgcca caaaatttct ctgtgcttca 

35 

tgtctggccc atacacctat gataattact 
ctgactggta catatggagc cacgaaagtt 
40 aatagatatg tattagatat atatatcaat 
gtatctgtaa tgaggcagac attgttttaa 
atatcttata tatataagat aatagtggtg 

45 

taggttaaaa cttttatggg acaggacata 
atatgttgtt agaagctaac tggaagaatt 
50 ggtgagagat gagtcattag aatgtaccca 
ttcattatga taagataaaa attattgaat 
tatatagtat ttctagattt ctcattctgc 

55 

ttctccatgc acgcttttca tgggaatgta 
tactttctcc agctccaggg atggataagg 
60 ttcttggttg aaagttggaa ggtaattagc 
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agaaatctat 


gctgactgtg 


acacaagagc 


43800 


cgatactgat 


agacttgttg 


atattattat 


43860 


tatagagaga 


taataaattc 


aaaattatgt 


43920 


catttcactg 


gctctttatc 


gagagtacta 


43980 


gaggcaacag 


cacttgaaag 


tttttcattc 


44040 


ttggcaaata 


aggtttggaa 


gcagaagagc 


44100 


catgcaaaac 


acatacttcc 


ctcccattta 


44160 


accaaagtgc 


tttttccttc 


aggaagtgga 


44220 


cttctcctgc 


ttttctttcc 


ccatagaaag 


44280 


tgtgcacgag 


ctatcatttg 


agtaaaagta 


44340 


gattcaactt 


atattttcta 


tttcatcttc 


44400 


cataattata 


tcttaatcat 


atatggaaat 


44460 


tgtttttatt 


gcagagcaaa 


aataaatcaa 


44520 


cttccttttc 


agatgcaata 


tggtggaaag 


44580 


tggtggtaaa 


taatgtgggt 


tttctaaaca 


44640 


atctttgttg 


tttactactt 


atgtagctat 


44700 


tatttctttt 


taggaaggta 


gtaataatat 


44760 


tggtataaca 


aaggtagaaa 


aaccaggact 


44820 


ttaattaata 


tatatacatg 


tatatatgta 


44880 


ttatcatttt 


attcagcata 


tatctgtaaa 


44940 


gcatataaga 


aaatagtggt 


aagagagata 


45000 


aataagagag 


agaacatcct 


tcctctcttg 


45060 


gattaacata 


taaacaatca 


aaaccttttt 


45120 


aatgtgctag 


agagtggctg 


ctgtagtggt 


45180 


ggaaagtttt 


ttttttgagg 


aagtgatatt 


45240 


ttgcacggat 


attcgtattt 


tccaattcag 


45300 


cctacactca 


catatctatt 


gtatccacct 


45360 


agtccctcct 


tctatcagtg 


ctaacctgtc 


45420 


tgtggtaaaa 


gtcagttaag 


acattctatc 


45480 


aagaaccacc 


aaatagaact 


taatactatt 


45540 
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gcttttcttt cttttttttt ttgaattttt 
tctgtcgccc aggctggagt gcagtggcgc 
5 cggttcacgc cattctcctg cctcagcctc 
gccaccgcgc ccggctaatt tttcatattt 
• caggatggtg tctcgatctc ctgacctcgt 

10 

gggattacag gcgtgagccc ccgcgcccgg 
gagatattgg tctatgtatg tacctaattc 
15 atcctacttg ttttatttct tcaatattgt 
tttattcccc atttgttctg ataatgatgg 
taactcttaa tttatgtaca agggtggagc 

20 

tgtgtcctac atctgggaag tggcagaatt 
gtagccttgt gacctaggaa aaaacaattg 
25 ttaatctcat cattttcact tcactgacca 
tgttgggttt ggagctgtga ggacgtaagt 
gatgctcaca gacaaaagca gaactgagaa 

30 

attcttttga gtctccaagt caagttttac 
atatataaag acaaggaaag agatttattc 
35 aacggcttcc tctctaattc agaataaaag 
tacatcacgt gattctttgt tgtttcttgg 
tatattgctt tgagattgtt taaaatggta 

40 

gcatacataa tttttttaac acttacataa 
aaaaagttca tgcaaatatt ttgtgctatt 
45 acaagactac ctgcacttgt ttcatgtcat 
ttcttggttt tatctatatg tgattagaaa 
tccttttgct acttttacaa taaaaataat 

50 

tgagagaggg tctggctctg ttgcacaggc 
cttaagccat cctcccacct cagcctccta 
55 cacccagcta ctttttctat tttttgtaga 
ttccctgtct gcttaatgat tgcaagaggc 
ttgccttagt agtggttgat acggttggga 

60 

gggaagcatc ttgggaccgc acttcgcctt 
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ttttcttttt tttttgagac ggagtctcgc 45600 

tatgtcggct cactgcaagc tctgcctccc 45660 

ccgagtagct gggactgtat acaagcgccc 45720 

ttagtagaga cggggtttca ccgtgttagc 45780 

gatccgcccg cctcggcctc ccaaagtgct 45840 

ccaactgtta atattttacc atgtcataaa 45900 

caatctttta attattgtgt atgtactgat 45960 

attaatactt ggcagatttt gtactcggca 46020 

cctgatttcc tccaaaaaat caccattatc 46080 

tcacccaacc cccaatttca ggagttgacc 46140 

acatcaactc agctagagtg aagttctttg 46200 

tgtgaatgac tgacacattt ccaatcaaaa 46260 

agaaatggtc tccccatctt tcctctttcc 46320 

ttgctgccat cagcagcccc gtgacagaca 46380 

gttccaagtg aacgatcaga ccctaatgta 4 644 0 

ccggagttgg cccaacccct ggggttttaa 46500 

agatagtgca ataactgggc aaaatccttc 4 6560 

ccaaagctca tataaatcct acatagtcct 46620 

atttttaaaa tatcctgact tatatcctga 46680 

agagagtgcc ttctcttggt gataattagt 46740 

gcaacctaag attagtgcat ttttagttgt 46800 

agtgcaagat attatgattc caacttttag 46860 

gtgatttgaa atttattctt catttggtta 46920 

gcagtaggaa aatgtccatg agactttgtc 46980 

tattgtttca taggtttttg tttttttttt 47040 

tggagtgcat tagaacctct gcctcctgga 47100 

agtaaatggg actacaggtg tgcaccacca 47160 

gacagtgttt caccatgttg cccagtttga 47220 

tcccctgctg ggttgaagtt gggccaagaa 47280 

tggtatggtt aaagttgaac tcatcattga 47340 

gtaaataatt ggaaaaaaaa agttataagg 47400 
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tcttgagtat 


ataaagtgat 


attattgata 


5 


aggtagatgc 


aagatatctg 


tgactaaaaa 


ttttgggttt 


tttttttgtc 


ttagactcac 




aaatttgtgt 


tatcaaatat 


ttctgtcccc 


10 


taccagtcct 


aaatatcaaa 


gattataaga 




ttgaggcttg 


tgtatttttt 


cagtcaatga 


15 


tggtaagtgt 


actataatca 


gcattgctac 




tccaactatt 


aaaagtaatt 


tatatttata 




gagaagtctc 


aaagggctca 


ttgttataac 


20 


tgtaaaatga 


taatactgta 


tgcaaagtta 




gttacactta 


cattactcag 


taaattggat 


25 


catctcgggt 


gttcccactg 


accattagaa 


taaaatttgc 


cattttccac 


aactcttact 




cttgcaatac 


tctatggaaa 


ggatttcaag 


30 


tgaattaaac 


cagaaggaag 


gaaataaaga 




aatagatgca 


ttttatctta 


atacttgact 


35 


tattatgaat 


cttcatcatt 


aaaatgtatg 




agcatatgtc 


tcatattatt 


tttcttcatt 




tatttcatgt 


acattttgga 


attacttcgt 


40 


ggattttgat 


taagaaggta 


ttgagtctat 




aatattgagt 


tttctaatcc 


attaacagaa 


45 


ttatttattt 


atattttttt 


gagatggagt 




ggtgtaatct 


tggcttacta 


caacccctac 




cagcctccag 


gtacctggga 


ttacaggcac 


50 


tcttttagta 


gagatggggt 


ttcaccatgt 




cacagcttag 


tgttagaaaa 


caaatttaga 


55 


ctctgcatag 


ctatcccctt 


tatggttctc 


tcaaattctg 


gtttttgtct 


tctgaatttg 




catggtctgg 


aaagtttctt 


catgcagaaa 


60 


ttcatttttt 


cagtgttgaa 


agtccggcac 



acttctaact ataagtagtc caagctcttc 47460 

tgattttttt aaaaaatttg aatccagtat 47520 

tgaggcaaaa tgaaggcttc acgctaagtt 47580 

tttctcctct aaaatctaat ttctcttctc 47640 

aaagtaagga taattaaaga ttaaagaata 47700 

taaagaaacg ggcgatcaat aataagcaac 47760 

aaaaactgct atctattgag aaaacatgca 47820 

aaatcaaatt aaacccactc gtaaatgtaa 47880 

agttataata ggtattttta catttgaggt 47940 

gaggcaaatc aacagttcta acagtgatgt 48000 

tatagagcat ggaacaattt ggggaagaat 4 8060 

atctgtagtt tatcttaaat taactatttt 48120 

tttcctgatt ctgagtcctg atcccataaa 48180 

agagacatac ttttatcttt aataaaaaca 4 8240 

gttgaagccc actgctgtgg ctgctgaagt 4 83 00 

agaaatttcc tgtatcttca ttttgtatct. 48360 

atagctttat ggtaggtctt gaaatcaggt 48420 

aacaaaattg atttagtcaa agtctttaat 48480 

caactttatg caatccaaaa agaccttctt 48540 

aggtctataa gggaaaattg ccaacttaac 48600 

tatacctttt cctttattta tttattattt 48660 

ctcgctctgt cacccaggct gtagcgcaat 4 8720 

acctcctggg ttcaagtgat tctctcacct 48780 

acgccaccac gctaggctaa ttttttgcat 4 8840 

taaccaggct agtcttgaac tccgtaaatg 48900 

atacttccct gcagatttct ggacctctgt 48960 

ttctcactaa ttatagctgt ctcactgccc 49020 

gcaatatcag aatggatccc tctccctgac 49080 

accagagcaa ttattatatt cacatttgtt 4 9140 

tgtttcccaa tgtctggaag cagttgtgct 49200 
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taattttttt ccggttttat ttatgatata 
acacccaggt ttgagtgcaa tggtgcaatc 
5 tcaagggatt ctccaacttc agcctcccaa 
gaccagcgca gttttctaac tgaccagagt 
gaagagctga gtttcaactt taaaaagatg 
gtaattccag cactttggga agtcgagggg 
ccagcctgac caacatggtg aaacctcgtc 
15 atggtggtgc gcgcctgtaa tcccagctac 
cccgggagga ggaggttgta gcgagccaag 
agagtgagac tccatctcaa aaaaaaagct 

20 

atttgtttat atttaaaatc gatttaagca 
acttataatt taaaatattc aagcagaaaa 
25 ttttagtcac cttaattggg tttcataaga 
aattattttt tatttaaatt aatttgtgtt 
gacagtttta atatacactc aaaagttttg 

30 

atgagacaaa gctatacaaa ttccagggaa 
tgtctctcag tttgaatttt gagtttactg 
35 acttcctttt gattgcctgg tggatagaaa 
ttgaacagag ctgatgatgc atttgttttc 
gtttttttac tgactaaggt ttgttcagat 

40 

aagtactttc gagtaactgt gtgaggctga 
aataagtaca ctctttccag cagaggaagc 
45 tggatctttc tatcatcaat ttcagcaaaa 
gaccgcttgt tttaatcctt ctcagactcc 
cttgctccca tctcttgaat gcatgctttc 

50 

ggaggagagt cacagtctgg atggggctca 
tcatctaggg gggttctttt ccccttaagc 
55 tactttcagt aactgattct tttgtcttgt 
ttagagaata ctgaaagcat tcccgtttta 
ccataacagg gttgctaaat ggcctaaatg 

60 

acatgtgtat ggaatatata tgtttgtata 
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aaaaaatttt tgagacaagc tctaactcta 4 9260 

atagctcact gcagcctcaa aatactgagc 49320 

agtgctggga tcaaaggtat gggccactgt 4 9380 

ggaagattaa gtctggttct atcataattg 49440 

atcataggct gggcatggtg gctcacgcct 49500 

ggcggatcac ttcaggtcag gagttcaaga 49560 

tctactaaaa aatacaaaac ttagttgagc 4 9620 

ttgggaggct tggcaggaga attgcttgaa 49680 

attatgccac tgcattccag cctgggccac 49740 

catgaacaca gatgttagca tttttatttc 49800 

tgtcaacata tgtaacttaa taaaattaat 4 9860 

ttgaagtatt gtccttttca tatgctgata 4 9920 

agcaactatt aaaagacact agcatgttgc 4 9980 

attttgcaat gttttatata gtaatatcca 50040 

ttttcacaca acataaatat aggaaacaca 50100 

aatatttaaa tctttgaatt tattaaaata 50160 

taggaaaaca cagaaatgat gtgtaaagta 50220 

aacaaaaaac aactaaattt gagatatggt 50280 

ctgtttgaat ctcctcaaga tcagcaattt 50340 

tgctactgct catatgattc ccaggaactt 50400 

taacactaga tttaaaaaaa atttgtcttc 50460 

agttcagaaa gttctttgtg gtttcccctc 50520 

ggtggtattt ttcttttaat aaaaggtcaa 50580 

aggtacagct taaccagata tagcctgtaa 50640 

agcttggcca ggaggggctg gacatcaaaa 50700 

ggaacctcac agtgctgtat aagccagcta 50760 

cagcagcaaa atgcttcaac ccttgtcaat 50820 

tttttttcta cttagtattc tctcccatat 50880 

cctctaactc tgaagccaca gaaactgtgc 50940 

acatatatgc aaatatatat acacaaacac 51000 

ttatttgaaa atatatttaa ataacttgca 51060 
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tattacatat aaatttatat atatttcata tccatatatg catatacttt ttctcataca 51120 

ttataaaact ctggcaacat ctacatgtaa catatgagtc tttatttgtt ttgctgacaa 51180 

5 

agattatctt tcccagaatc tggattagaa aatttagtta tttagtattt ggttgttatt 51240 

atttcaatat tttgatttaa tttattttgg aacagaaatt agcctgcaaa ttctgcaaat 51300 

10 ttttcttaga aatgtatcat atgcataaat ttgttttagc tgacctagaa aacggagggc 51360 

ttatatttcc ctttttattt ttctatataa aagatatttg atttaagaag aaaatcagtt 51420 

cgaagtcttt gtaaatctga tatattctct tttcccaatt gtattgccca agtcagagtc 51480 

15 

tattattttc cacaggcaga tatttgaaag gacttaacaa ataccacaga aggtactcac 51540 

ataaggcttt tatctaaagg cagagagtac aatgcagcaa gagagagagg gcaagcactg 51600 

20 gagacattcc aagcagtaca agacattcca gatagtacaa gctccccata gacctaacca 51660 

gacacatgaa ccatgccagt ttctgacttg aactgcccag atctatgcaa gaaactctct 51720 

atatcccatt aagtagtagt caagcttgtc aaactcatta ggctagttgc aagtcattag 51780 

25 

caaagaaact gaccttgggg cttgccaaga gagtttaaaa ttattctaaa ctgcatatta 51840 

taaattctaa tggtcttatc ttgaccaaga gccaaaattc atacattcag gcatccccag 51900 

30 gattaagaca gagctcttaa gtacctctgt aannnnnnnn nnnnnnnnnn nnnnnnnnn 51959 

<210> 77 

35 <211> 84248 

<212> DNA 

<213> homo sapiens 

40 

<220> 

45 < 2 2 1 > MS4A4 E_ini t i a l_coding_region 

<222> (18861) . . (19004) 

50 

<220> 

<221> MS4A6A_initial_coding_region 

55 <222> (67182) .. (67328) 



<220> 

60 
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<221> MS4A6A_coding_region 

<222> (68944) . . (69078) 

<220> 

<2 2 1 > MS4A6A_coding_region 

<222> (70593) . . (70649) 

<220> 

<221> MS4A6A_coding_region 

<222> (73298) . . (73507) 

<220> 

<221> MS4A6A_coding_region 

<222> (75780) . . (75881) 

<220> 

<221> MS4A6A_coding_region 

<222> (76656) . . (76748) 

<220> 

<221> MS4A4E_coding_region 

<222> (42168) . . (42236) 

<220> 

<221> MS4A4E_coding_region 

<222> (40220) . . (40321) 

<220> 

< 2 2 1 > MS4A4 E_coding_region 

<222> (35782) . . (35940) 
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10 



<220> 

< 2 2 1 > MS4A4 E_coding_region 

<222> (34287) . . (34343) 

<220> 



< 2 2 1 > MS4A4 E_coding_region 
15 <222> (23822) . . (23950) 



20 



<220> 

<221> genomic_DNA 

<222> (1) . . (84248) 

25 <223> nisan undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 



30 



35 



40 



45 



50 



55 



60 



./i Art. »7«7 

<4 uu> / / 

nggataatat 


taggctaaat 


acctaatgta 


gatgacaggc 


tgataggtgc 


agcaaaccat 


60 


catggcacgt 


gtatacctat 


gtaacatacc 


tgcacttctt 


cacatgtatc 


ccagaactta 


120 


aagtataaaa 


aaattaagaa 


atttttaaat 


aagaaacctg 


aaatgtagat 


attctatagg 


180 


aaaagtgaca 


tggtttcttt 


aatagataaa 


tgacatgaat 


aaaaaagagt 


gggaagggga 


240 


aaatatagat 


tgaaggagac 


ttaattaaca 


tatcaacaaa 


atgcaatgtc 


tggcctttgg 


300 


atggatgggg 


ctggggtgtt 


cttgacttaa 


gaaaatgaac 


aaatatgtat 


gagaaaataa 


360 


taaaatttga 


atatgaactc 


tatattcgat 


gatactaagg 


aattattgat 


aatgctgtaa 


420 


aatgtgataa 


tgatattaga 


aaatgttatg 


gaataagtgt 


catctgttaa 


tgatccatac 


480 


caaggtataa 


actagtgagg 


tgatattaaa 


tgtgagattt 


actttaaaat 


actcttacaa 


540 


atatgtgcag 


ggtggagaga 


gatgagaaaa 


cattggcaaa 


ctatcgataa 


ttgttgaagc 


600 


agactggtgg 


atgtatggta 


attcattata 


ctaacctccc 


tacttttgtg 


tgtgttttaa 


660 


attttccaga 


agaaaaagta 


aaagacagaa 


agtggtgatg 


gtttcatgag 


tgtatattta 


720 


tctaccaaat 


gatcaagttg 


tatacattaa 


atatatacag 


ctttttgtat 


gccaattata 


780 


cctcaataaa 


gtggattttt 


aagacagata 


agaaataaaa 


ataagataga 


taaaaactac 


840 


acgtgattgt 


gaaattaatg 


aaatgagtgt 


ttttagctcc 


atcagttcag 


tttggttctt 


900 


tcttaaaata 


gcaatttcat 


gttttatcta 


ctgtactatt 


ttattgtatt 


ccttagattc 


960 


attggattag 


gtgtagactt 


tctgctgttg 


atgatctttg 


ttcctatcca 


tgttctgaat 


1020 
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tctacgtctg acatttcagc ctttcagcct 
agtaatttgg agataagaag agactctagc 

5 

cagtcatctg tttgtgctga tgttccttca 
tttttttttt cttttattgc ctttgatgcc 
10 tttagttgat tggctttgat tctagtcagt 
gtattggatg aggcttccct gattactgtc 
ctggtccaca gaactccctt aggcaggggc 

15 

gcttagcgct aatctgctgt ctgaatgctt 
tggagtttag gtagaagcag gaccactggg 
20 caaagattgg tggagttgcc caccctatca 
cacttcttgg taatttcagg cagaaagaag 
tgtggctcac ctgtctacca gtggtaggag 

25 

tgcttcccag ggaaacagga ggctgcacct 
ctaggccaga actctagcag gcattgccca 
30 tcacctgccc tgctgtcagg gtgttttctg 
cctaaagact gcaatttctg gctgagttca 
ctagcaagca ttgcctgccc ggctaccagt 

35 

tgggtgcttt ctgggacaag agtaggctat 
aatgctgggt aggaaggtgg cacaaggaat 
40 acagtcccta atcaagagag gcaagagcta 
gactgcaagt gtcttttctt ggtagaaggc 
tgtctggtag tttgtcattg ataccttctg 

45 

ttctaatcct acacaagcat tttttaactc 
atcatcttat ttttatcaag aatattgctt 
50 gtattctgga tatatatcct ttcttaatta 
gggttgtcaa tttttgctat gtatatttag 
aaatgattac atctttctga agattgtact 

55 

tcatcagtaa catattttgc tctaaggtca 
ggtttgctcg atatatcatt tctcatcctt 
60 ttacttcttt ttaacagtaa ttagttcagt 
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ggttaaaaac 


cattgccgga 


gaactagcgg 


1080 


tttttgagtt 


gccagttctt 


gcacagctct 


1140 


atctttgaag 


ctgctatctt 


ttagatgggc 


1200 


cttgggggtt 


tgattgtggt 


ataagttggg 


1260 


agactggctt 


tgatcctagt 


cagttcctgg 


1320 


tctctgctca 


catttattat 


gttgggcatt 


1380 


cacagttggc 


ggacaagtag 


tatccttaca 


1440 


tccagggtaa 


cacagtgttg 


tccccaccta 


1500 


ctataagctg 


gggtgtctgg 


ttacaaaagg 


1560 


tctgggtgtt 


tctggggcaa 


gagaatgctg 


1620 


tagcactgct 


gagctggaag 


ctccagcaga 


1680 


tgggtagggc 


tgtcctctct 


ggcatctgga 


1740 


gctggctcag 


ttcacagaga 


agtaggacca 


1800 


ccaggctacc 


agtggcaggg 


ttgggttgca 


1860 


tgattacagg 


aggctgtgcc 


ctctggcaca 


1920 


catataagaa 


ggactactgg 


tctggaagct 


1980 


gtcgggggtg 


cattgcttgt 


ccttcttgtc 


2040 


ggctactggc 


caagtccagg 


cagaagtgga 


2100 


attatgtgga 


gattatttca 


gaaccctaca 


2160 


aaggaatcag 


gaaccctagg 


agggctatgg 


2220 


ccaacagtta 


gctcaattgt 


ctgtgttaaa 


2280 


ttacataccc 


tctatgctcc 


atcaatcgta 


2340 


tgtatttctc 


ccacctcctc 


aagccaagcc 


2400 


acagcctctg 


aatgttccct 


ggttttttat 


2460 


catgtattaa 


aatattatct 


aatattctgt 


2520 


aggttctgtt 


attaaatata 


tacacactta 


2580 


atttttataa 


ttaaaaatta 


atccctttgc 


2640 


attttctctg 


atcagttttt 


tttgggttaa 


2700 


gtactttaca 


gtttttgtat 


gcattaaaat 


2760 


tttgaccttt 


taaaatttta 


tctgaaatat 


2820 
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ttgtatttta ttacatatga cgtatttatt 
ctattgactt tcctggagga aataaatgtt 
5 tacacatgct ttaattttta ggattacaag 
aaattaatta gagggagaaa taatgcaata 
atagaaaaca tttttcgttt gttcatttgt 

10 

attgattttc ttaaatattc agtgttcatt 
tgtcagggaa tgtaggtctt gcttaccata 
15 gggcataaag ccaggttata catacagatt 
ccttctccag actaaggtga cctctgtgtc 
gagcgcagga ctttcatttg ggaatcaatt 

20 

ttataccttg tgaatggaag cttagacctt 
gagctggcac tcacattcta tatacttgtc 
25 ttggtttttt gctcataaaa cagttctcat 
ggtgatgcct gtgtgaggga cagaggaaca 
aaattaatta acaactgctc tgggctctga 

30 

aacttgagct cttttttatt tgcttttatt 
tttagaggta gggtgattct atttatgttt 
35 atatcaccat aaatcctcaa tcacttgtct 
cacactatat ggaagaggag gtaagattga 
tcttataaag tcatgaaact ctgtaccttt 

40 

gcctgttatc ttctttaatc tctttttgcc 
ctgtttctat tctctagcaa aaagggtctg 
45 taacctcatc cctctgggtg aagagctgtt 
cctagctcct gccagttatc caacttttgt 
aaattaaata tgctccaagt cttactcttt 

50 

attgccttac tgaatggctc tatcattcat 
tcctttcctg ttttctctcc tgatttacac 
55 tcgttcctaa acatctttca aatccattca 
gttcagtaac catcatccct tccctagaga 
tttcagtatt atttattcct attgttgctt 

60 

tataacccat ttaattctga gaattaaaat 
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tatatatttg ggattaaatc taccatcaca 2880 

ctaaatgtta ttagaatttg ataaatcaaa 2940 

aacacacaat gaaagacttt ataagtacta 3000 

aaggggtata tagaaccttc aatcaaaaaa 3 060 

ttgttgtatt tggtttctct gttttacgat 3120 

tacactttca tttggtaatt tctcctatca 3180 

gcttacatcg ggtgactgtg agagaagagg 3240 

ggttgtgatg tagttgagag aaagcccctt 3300 

ttcatcatca ggatcccagc tctaaagagt 3360 

ccataatatt ctctgctttt tgcttttcct 3420 

aattcttgtc tctttcctgt tttttactca 3480 

tagagcagaa ccctaaaaca gttttttggt 3540 

attttggttt ggggataaac taccttgtta 3600 

actgaccccg cttttccaat ataattttaa 3660 

cctactctgc tgcttgttgt tttgacactg 3720 

tgtatagtag ttagttttgt cctatgctta 3780 

ctctgtcaaa tttcaatttt attgttcttt 3840 

gctaagaata acactaataa tatctcactt 3900 

catctgttct tagtctagaa tttcaacctc 3 960 

gtcttctgta atgccacatt ttcctagttt 4020 

atcctctgga aacccttatt tctgataata 4080 

attttttctg cctcctccca caagcctggg 4140 

gggtttacct ctacagccat gttctcattc 4200 

gaaacttatt tgaaggtgcc cacacaattc 4260 

gttcctgcag ttctgtcttg gtccatgtct 4320 

tcagtttcct aacccagaat ctgagcgcca 4380 

atccattata tcaacaagta ctgtcatttc 4440 

ttctctccaa cttcactagc atcataggta 4500 

actgcctctt tcttaaccct aactccctga 4560 

tacatgaaaa ccaggacaat atttctgaaa 4 620 

aatttcagat tatagacaac aaagtttttg 4680 
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ggattaggca aaactggatt tcaatctcat 
tcagttccat cttctggaaa atgggaataa 

5 

ttgtgtgtct tttgcttatt catcaggtct 
tcctctgcct acccaacgtt gagttctttg 
10 attttacctt gcagatcact gattataatg 
tgccacttta ccttgcaggt aactgattat 
tcaagtagcc tataaaatcc ttgagcgaaa 

15 

tggcacactg cttggcttat agtagctttt 
gcatgccact ttgctttcct ccatgtaaag 
20 ggttcagggg attaacaagg tcatacaaga 
agaccagctt gttctatgat accagctgct 
attgctgtac aaccaacttg ttcccatttt 

25 

aaatatcatg aaaataagat atttaagttg 
aacctaatac aatagtaaca gtagttctct 
30 attgtgaaag ggggcattta taaattacca 
agcagagtct ggcaaacaga aaatcctcca 
attttcatag gaaagactgt atcaggcagc 

35 

cgtccacacc catattggga acgcaaaatc 
agttgtgtgt ctcagaagtt ctcaacacat 
40 ggagtgcaga tttaaactga gtgaggtgtg 
aagaactttg aggaacttgc ccagagccct 
ggcctgaaga aaggtaggtc cagggacttg 

45 

ttcatttcct gggtttagga tttctgggag 
ccatgaccca agctggtagt cagggttggg 
50 tggtgagggt tgtgggtggt tgaggaacag 
gcctcctaaa ggctgacact ggaggctgga 
agttctcagg ataccaaaac actgtaagca 

55 

tactgttaat gcacttcttc catggtcttt 
attcaagact taagaaaaga tctagcacgt 
60 aggcagggtc cccagccttg tacatttaga 



ctataccagc 


tatatgaata 


tctctgagtt 


4740 


taatgttacc 


tagctccatc 


cttcttccat 


4800 


ccaattagat 


gtccctttat 


ctaggaagtc 


4860 


atcctcctat 


gagctctctc 


aggatatcca 


4920 


ggttgtaatg 


atttatttat 


tcatcagcat 


4980 


aatgggtgat 


ttatttactt 


gtctatattg 


5040 


atcatttttc 


accattttat 


ctgaagcaca 


5100 


ctctaagaaa 


atgctgagtg 


aatgaaaatt 


5160 


aaggcattat 


tttattttac 


caatgaatca 


5220 


aatgattgcc 


attgggtttg 


tcttagtcaa 


5280 


tctttgactt 


agaaaggtgt 


tgtattaatt 


5340 


atggctattc 


atgttgaacc 


tccaaaggtg 


5400 


tgtatattaa 


ggcacaatta 


ctttttcacc 


5460 


gggaagaaga 


gtactgtgga 


aatgtgtgaa 


5520 


tttacttagt 


tgtgtatata 


acaatctttg 


5580 


ccactgatac 


ccacagtaat 


tagcactatt 


5640 


cactgctttt 


tcagcaaatt 


tcaaaataaa 


5700 


agttaatgct 


tccgataaaa 


aagaggaact 


5760 


cctttaaggt 


tccaaacatc 


tactagaaaa 


5820 


gagtggggga 


agctgactgt 


gtctggaaca 


5880 


gcactcatca 


gacctgcagc 


agacattgca 


5940 


ccttcctagg 


ctctcggact 


gatagtttgt 


6000 


tggagtgaga 


aaaaaaaatg 


aggtcacttc 


6060 


gtggattgag 


tgcacattta 


aatatgcatt 


6120 


tcttcgctac 


agttgatggg 


agttagaaaa 


6180 


tgaagttgtt 


ttagttactt 


ttgttccgta 


6240 


acattttaaa 


ctttgattaa 


ttacctcatc 


6300 


attctttgtc 


tctgactggt 


tatttttata 


6360 


gcagaatttg 


gtgtacatgg 


agtagggcac 


6420 


tgggctgtgg 


cctgacgtag 


ttctccagtg 


6480 
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gtctttggca ctattcatgt ggcacccagg 

tgcaaatgaa aaacatcttg tagactttta 

5 gtagtcgtag tcattgtaat ctttccccag 

taatgtacct agaatcacac aagttaacaa 

tggttcctag cccaggcctt tccccctttt 

10 

atcatcctcc taaaaaccat ccagattcag 

aactctgcca agaaaaaggg gatgaccgtg 

15 gtgacccata agggggatgg tgacataaaa 

gatccacaag gagactgggg ccaatagaag 

tttatccggg gtggggagga ggggcgggga 

20 

tataggtgat gtggcaaaac caaggtttag 

ccctcatatg tatccaatat tctctgcagt 

25 tggagtcttg taaacacacg ctttctttcc 

cacatggatt tctgtggtgt aaagatagga 

cttataaaga cagagcacca gccttgaatg 

30 

tatggaaaat cataagagtc aattataagc 

gaagcttaca gccttacccg agaggcacta 

35 gataagacag acccaacatg tcctaaattc 

gccgagtttc ccttacatta aagaataggc 

gcactttggg aggcccaggt gggtggatca 

40 ' ' 

aacacggtga aatcccttct ctactaaaaa 
gcgccggtag ccccagctac tctggaggct 
45 cggagcttgc agtgagccga gattgcacca 
ctccgcctca aaataaataa ataagtaaat 
tagatggact aagaagaatt atcttcagtt 

50 

taaaaggaaa ataacataga tttcttaaat 
acatgcggga aatgggggct gtaggagaat 
55 aactgttagt gtgtgagaag attagatgct 
ggtcatggcc gggggagtgg aacattcctg 
atataggaat gaaattactt tagaacaact 

60 

aaaataaaaa ggagaaattc aaggtaaggc 
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cagtaaactg aggctgctca tgataaaatc 6540 

gcattcttag gataagaaag aagcttgtaa 6600 

atgaagtaaa tgaaatccag aaaagggaaa 6660 

aagggctagg agcaaaaaac ccagaattct 6720 

cccttcccat gccaccattt ctgcatcccc 6780 

gcctcatttt gggtttctag tctatttgat 6840 

tataccactc tcctcatgct ctcttgtgtt 6900 

atgaggaaac aagagactgc tggtgctgct 6960 

agggaagggt ggtctaggtt accccacctc 702 0 

agacattgta gttgtcaaga acctgatgca 7080 

attcttctcc aacctgaaag aaatgtccct 7140 

taaaaaatat atattttgat tgtagagaag 7200 

ttggggacct tatctaccct agataattaa 7260 

ggcagttata taaatttatg tcaagaatgt 7320 

accagaacct gacctatgtt accaaagaat 73 80 

cttaatttga tataatggct cggccctcag 744 0 

aacacagagg gagaactacc aagtgtgttt 7500 

tcagtctgaa ggtgtgcgtg agagattgag 7560 

cgggtgcggt ggctcacgcc tgtaattcca 7 620 

cgaggtcaga ggatcgagac catcctggct 7 680 

tacaaaaaaa ttagccaggc gtggtggcag 774 0 

gaggcaggag aatggcgtga acccgggagg 7800 

ctgcacgcca gcctgggcaa cagagcgaga 7860 

aaataaataa ataaataaaa agtacacatt 7920 

tctttcaaaa taactttggt ttctgtggtg 7980 

ccttagttac cactagaggt tgagtaatcc 8040 

ggtgtctgag agaagacaga aactatgcaa 8100 

9tggtgggtt tgttgctatt cttgtaaggt 8160 

caaacgggct caacatatgc aaatgtctca 8220 

taaataagtc caatatactt ggggctttaa 8280 

tacgataata gatatgggtt gaagtagcag 8340 



-151- 



WO 02/062946 



PCT7US01/48437 



caatatcaac aacattaaat ttttatcttt 
actttaaagg ggagggtgac acagttgtct 
99Qtgggttg aacatccaag gctacaaatg 
aatactcctg ataaaaatag gatgatggct 
10 taaaaaatgg ttgcgtagaa ttagaatatg 
aggtatcaat gacaaactac cagacattta 
tctaccaaga aaggacattt gcagtcgcat 

15 

tcttgcctct cttgattagg gactgcttgt 
ccttgtgcca ccttccttta tctgaaagat 
20 tttggtcttc taatgtttga aataaaaatt 
aatgatcaaa ttatgggctt tattatagtg 
tctctcaagt tcagcaaaat ttgagggtat 

25 

gctattttat tttattcgtt tgctccaaat 
gaagtctact tataggtttg attataccta 
30 taaactccat tatcattaga acagagatta 
ctctacatag tcttgtatct acctagtcca 
ttctgaccaa tctttgagac aaactgcaaa 

35 

gtaccatgaa taacatgctc tcaccagtta 
cccccaagac aggacaggtt ggagcagggt 
40 gtccttgagt ggtaggaagt gttagaggaa 
agtaagtgat agtaggaaga ccaataataa 
gacatgaact ggttgaaatg tttacgtgtg 

45 

atgctgtatt attcacaggg tagagttaca 
agtggcaggg ccaaaattta aatccgggta 
50 gatccactac actagtctgg cgaatagaag 
tcagttgttg gacatttggg ttgtttgcac 
aaaattcatt ttacaagttt ttgtgtggac 

55 

gggaatggaa gggataggtc gtaaggctgg 
tagggaggaa tcctccttat gttttaggaa 
60 tattagctct tctgtggtat aaacacttct 



atactaaagg 


tggtgaaatg 


caaagaaagg 


8400 


ttgcatttga 


tcatttggct 


acaaagtcag 


8460 


gacaccgggg 


aacattcaga 


ggctgtaagc 


8520 


tggcacgcgg 


agatgggaga 


aatacagagt 


8580 


attgatggag 


catagaaggt 


atgtaacgga 


8640 


acacagacaa 


ctgagctaat 


tgttgggcct 


8700 


agccctctta 


gcattcctgt 


tacctttagc 


8760 


agtgttctga 


agtaatctcc 


acataatatt 


8820 


aaattttgag 


gaagactgaa 


tatcttctaa 


8880 


acctcttctc 


tttgctggaa 


caatttgata 


8940 


acttcaaaat 


taaccccaga 


gggggacaat 


9000 


ggattcagtt 


taagaccaaa 


aataaggcct 


9060 


ctaaattttg 


tattctgttc 


aggggatgct 


9120 


ctcaataaaa 


tgtaaatgta 


agtcacctca 


9180 


caagaggaag 


aggagggcct 


cttggcccac 


9240 


tgagagcaaa 


aacaagcaca 


accctgactc 


9300 


agagaggaag 


agtcaacctc 


tgacaacacg 


9360 


tatagacccc 


tcctttctgc 


tttttcactt 


9420 


catgccagga 


ataaagcccc 


tctttagcat 


9480 


atttatgttt 


tatacttcaa 


gccagaaatg 


9540 


tagtcaatag 


gtagttagta 


tttattaagt 


9600 


ctatttcatc 


attaaatttc 


tctacgagat 


9660 


gaaacacaag 


attaattaac 


ttgagtaagg 


9720 


atttaattcc 


agaactcgtt 


tgcttaagag 


9780 


aactctttta 


tttgcacatt 


tattcattcc 


9840 


cttttgacta 


ttataggtaa 


tactgctagg 


9900 


atatgttttt 


atttctcttt 


gtatgtatct 


9960 


ttcatggttg 


acattttgca 


aagaatcagt 


10020 


tagtttcagt 


aggattagct 


ctttggtatt 


10080 


ttgtacatgt 


gttagaattt 


ggatgtgaat 


10140 
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ctatagggct cagtaatcta atcttccacc 
aatggaaaca acttgtagaa aagaaataat 
5 aaacacatta gagtaaatca acaaaaccca 
gaaaaacctt tagctatatt gaatgacaaa 
atgaaagagg cagcataatt accaaactta 

10 

aaaattatat ccaaataaat cagataacat 
actaaaactg actgaagaag aaatacaaaa 
15 tgtaataaga gattaaatga ggaattttaa 
gtgatttcac tggtgaattt tactaaacat 
aactagaaga gaagggaaca attccaaact 

20 

caaaatcaag aaaagacaac ataagcgaac 



30 



40 



cagacacaaa 


tgctaaaaaa 


aaattagcaa 


tatatgagga 


caaagtcaga 


ttaatctcaa 


tgttacataa 


tgcattaata 


gaataaaaga 


tcaatagatt 


aaaaaattaa 


caaaatccaa 


gacgaataga 


atggaatcac 


tcccccacca 


ttaagcaagc 


tcgaaggata 


caaataatat 


tgtacatgcg 


gccaacagtc 


atatgaaaaa 


acaagtgaaa 


accataatga 


aatgccccct 


gtcaaaaaaa 


taaaaataac 


agatactaac 


actgttggtg 


tgagtgtaaa 


ttagtttaac 


aacctaaaaa 


cagaaatacc 


attcaaccca 



45 aatagaaatc attctatttt aaagacacat 

cagtagcaaa gacatggaat caacttaaat 

tgtggtatat atacaccatg aaatgctatg 

50 

attgcaggaa catggataga gctggaggcc 

aaaaccaaat actgcatgtt ctcacttata 

55 acataaaggg gaacaacaca cactggggct 

aggagcagaa aaaataaatt ataggtacta 

cagtggctca tgcctataat cccagaactt 

60 

gtcaggagtt caggaccagc ctggccaaca 
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ttaataaact 


ggaaaaagaa 


gaacaaacta 


10200 


aaagactaga 


gtgaaaataa 


tagagagtag 


10260 


aagttcattc 


ttgcaacgat 


caacaaattt 


10320 


aattacaaga 


agaaatcact 


aaaatcagga 


10380 


cagaattaaa 


gaaattataa 


gggatactat 


10440 


atgaaatggt 


caaattttta 


tgaagatact 


10500 


ataagaagaa 


atacaaaatc 


tgaatagatc 


10560 


aaacttttat 


aaaggaaagc 


ccaggctcct 


10620 


tttaaaaaga 


attaataccc 


ttccatcaaa 


10680 


tattacataa 


ggctagtatt 


attttgacac 


10740 


aaaactacat 


atgaatgtct 


ctcatgaata 


10800 


gcagaattcg 


gcaacatgta 


aaaaagatta 


10860 


gaatataaaa 


tttagcattc 


aaaaaatcag 


10920 


caaaaaccgc 


aaaaagccac 


cacgaatgcc 


10980 


tactttttat 


gataaaaata 


ctcatccaat 


11040 


aataaaaagc 


tattagaatt 


aatgaatgag 


11100 


acacaaatga 


acagacactt 


caaaataaga 


11160 


atactcaaca 


tcactgctca 


ttagagaaat 


11220 


tacaccaatc 


agaatgacta 


ttaacaaaaa 


11280 


gaggttatgg 


agaaaaagaa 


atattaatac 


11340 


cactgtggaa 


cactgtggtg 


attcctcaaa 


11400 


tcaatcccac 


cgtgggtata 


tacccaaagg 


11460 


gcacacatat 


gttcattgct 


gcactattca 


11520 


gtccatcaat 


agtaggctgg 


ataaagaaaa 


11580 


cagccataaa 


aaaagaacaa 


gatcatgttc 


11640 


attatcctta 


gtaaactaac 


acaggaacag 


11700 


agtggagcta 


gatgatgaga 


acacatgggc 


11760 


tattagaggg 


tQQaQCfCtQQ 

33 **3j v v 33 


Qaaaaqqqaa 


11820 


ggcttaaaac 


atgggtgata 


tggttgggcg 


11880 


tggaaggctg 


aggtggatgg 


atcacctgag 


11940 


tggtgaaacc 


catctctact 


aaaaatacaa 


12000 
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aaattagccg ggcgtggtgg tgcatgcctg 
gagaattgct tgaaccctgg aggcggaggt 

5 

ccagcctggg tgacagagca agactcggtt 
gataaaataa tctgtaaatc aaaccctatg 
10 gtacccctga agttaaaata aaagttaaat 
agttctatat gctagcactg aataatttaa 
aatagtttca aatagaacaa aatttatcaa 

15 

acaaaatgtt aaaagaaatt aaagaaaatt 
aactggaagg cttaatattg gtaagatggt 
20 cacaatcttt ataaaaaatt caagcaacct 
atcatatgga aatacaaggg actcagaata 
ttggtgcact tacactttcc aattcaaaac 

25 

tggtattggc ataaggacag acatatagaa 
aaaataaaaa tgggatggaa tagaatcatc 
30 tgatttttga caaggataac acaactacta 
catgtttgaa caacttcaga ttcacatgca 
aaacaaaaca gctcaaagtg gagcaaaaac 

35 

ccttctattt aggtatattg agacctatat 
aaattttaga tatagagcaa agagataaaa 
40 atttaataca taccttccaa ttttgttaaa 
agtcattaca aaaaagccca tagtttgatc 
tttatttcaa acatttgaat acctaaatag 

45 

ttcaaaaaag taaggtggtg caagggacag 
tagtcgctaa tcaagagtgg caagatctaa 
50 actgcaagtg tcctttccct gttgaaagcc 
atctggaggt ttaagatcag tcactgtggc 
ctgaggtaag aagattgctt gaccctgtga 

55 

actctgtctc tacaaaaagc acaaaaatta 
cagctacttt ggaggctgag gtcaaggctg 
60 agcttgtgtg atagagtgag accctgtctc 
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tagtcccagg 


tattcgggag 


gctgaggcag 


12060 


tgtagtgagc 


tgagatcaca 


ccactgcact 


12120 


tcaaaattta 


aaaaaaaaaa 


aaacctggat 


12180 


acacaatacc 


tgtatgacca 


acctgcacat 


12240 


taaatatata 


tacatacaaa 


aataaattgg 


12300 


aaatgaaatt 


aagacaacaa 


tctcctttac 


12360 


aataagcaca 


agacttgtac 


atcgaatact 


12420 


taaacaaatg 


aaaaaaaact 


catattcagg 


12480 


aatattattc 


aaattgacct 


acagattcaa 


12540 


tatttcagaa 


atttgcaagc 


taattccaaa 


12600 


gtccaaacaa 


ttttgaaaga 


gaaaaatagt 


12660 


ttactacgaa 


gctataattt 


taagacagtg 


12720 


caatggaata 


gaattgaata 


aaattgtcta 


12780 


tagaaataaa 


cctctatatt 


tatgactagt 


12840 


agtgagaaaa 


gaatagtctg 


tttagcaaat 


12900 


aaagaatgaa 


gttggacact 


ttcttacacc 


12960 


caaaatgtaa 


gacctaaatg 


taactaatac 


13020 


gtagacttta 


gcatcctgtg 


agcagaacac 


13080 


taaaatagca 


ggccttattt 


ttggtcttaa 


13140 


cttgggagaa 


ttgtcctcct 


ctgggcttaa 


13200 


atttaccaac 


ttgtcccgga 


agggagacaa 


13260 


aagatatcca 


gtcttcctca 


aaatttctct 


13320 


tatgtggaga 


ttatttcaca 


accctacaag 


13380 


agggaacagg 


aaccctaaga 


aggctatgaa 


13440 


caacaattaa 


ttactttgtc 


cgtgttaaaa 


13500 


tcacacctat 


aatccaagca 


ctatgggagg 


13560 


atttgagaac 


agcctgggca 


acatggcaaa 


13620 


gctgggtgtg 


ctggtgtgca 


cctgtaatcc 


13680 


cagtgagtca 


tgataacgcc 


actgcactcc 


13740 


aaaaacttct 


ggtagtttgt 


cattaatgcc 


13800 
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ttctgttacc tacactgtat gctcaattgc 
tgacagtttt tgattagaag tttattttat 
5 tgactactat ttgcatggag tatttcatcc 
gctaaagtga gactcttata gcatatagat 
tgattgaaga atttaattcc tttatattta 

10 

gtgccatttt gttatttggg tggggggggg 
atctcttgtc tccctttgtg gttaggtact 
15 tatctttcat gtatctgctg cagttctaca 
aagggcatgt atgacataaa cctctggtga 
gtgcacaaag caaagagaga gagatggggc 

20 

tgccatagct taccctctcc catgataatg 
ctgggtgtaa ttacctctga aaggtcacac 
25 tttcacatga gttttagagg ggacagtgaa 
actcatgttc ttctcacata caaaatacaa 
tacccattct agcatcaagt ctaaagttct 

30 

tgagattcaa ggtatgactt atctggaggc 
aaaaccagtg atttgtttcc aaaatataat 
35 gttcaaaaga gagagccagg caagaaaaag 
caacagagaa aacagaatta tgttttaaat 
ctgaatcctg ggagtgctga ttggggccct 

40 

ggactcagtc cgtgcttcag ctctgttagg 
ctgagatctc aaaggtagcc ccactcccaa 
45 tctctgtgtt ggctctgact ccgcattccc 
aagtggctcc actcccatag ctatatgtga 
gcccctacat ctcttgcagt ctgtgagcct 

50 

ttgtgccttc caggcacagg tattccaggc 
tcagggtctt gcaaaactgc tgtggcatat 
55 gatgcaggtt acccacaaag ctaaggctgt 
ccatggagca gggatggaat tgagactgat 
tccagggaaa gaggggtgct ttggaggctt 

60 

tttggttcca gagacaatcg ggcatagcag 
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tcattaaatg 


atgaccttct 


ttgcctcttt 


13860 


ctgacctaag 


tacaggcagt 


ccttctcctt 


13920 


cttcactttc 


agcctgtgtt 


ttttcttaaa 


13980 


ctgttttttt 


ttttaaattc 


agccactctg 


14040 


aggtta.ttat 


taataggtaa 


gtagttagta 


14100 


ttgtttttgt 


agatatttgt 


ttatttcttc 


14160 


tttatgaagt 


actaagcttt 


taattttttt 


14220 


gttcttatag 


ttctagaggc 


tgggaagtct 


14280 


gggtgatccc 


atggaaaaag 


atggaagtga 


14340 


caaatttatt 


ctttatcagg 


agtccactcc 


14400 


acatttatcc 


attcacgagg 


atgaaggtct 


14460 


ctcttaaaac 


tgttacaatg 


gcaattaagt 


14520 


atcatagcat 


ttcatccacg 


accctccaaa 


14580 


taattctgtt 


tcaatagcct 


caaaagtttt 


14640 


aagtccagag 


tcatcagaag 


cagatatgga 


14700 


aaatgtttcc 


agctgtgagc 


ctgtgaaatc 


14760 


ggtgagatag 


gtatagagta 


gactgtctca 


14820 


gcataactgg 


tttcaagtga 


gtccaaaatc 


14880 


ctggagaata 


atttcctttg 


actccatgtc 


14940 


cagacatgac 


cctgctccca 


tatctttgct 


15000 


ttggagttgc 


atgctagtgg 


ccttatagtt 


15060 


ggctccacta 


ggcatatccc 


taattgggac 


15120 


actcagcttt 


gccctaatag 


aggctcttag 


15180 


tattctttga 


tcctaggtgg 


agaaagccat 


15240 


gtacacatgg 


attccaccaa 


ggtttacagc 


15300 


atagtggcag 


tgcgagctga 


ggtcctaggc 


15360 


ggcacaggtt 


cactcaccca 


gttgtggctg 


15420 


aactctaaag 


cacacctcag 


taactcaagc 


15480 


tctggggtca 


gtgcacaaca 


tcatcaaggc 


15540 


gagctctggg 


gagcaggtta 


ctgttgaaat 


15600 


caattccaac 


cctgggggat 


ggagtgccac 


15660 
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atagtgttaa ctctagaccc tgggatggta 
ctggatgtag cagcagcaag gcctcaggaa 

5 

ggtttcttgg agcaggtgca gaaatgaccc 
gactccaggg gacaggtcag ttccggggag 
10 gtggggtgac gtagctcaat caaagctatg 
ttgagtttgg aggggcaggg tgcaccagcg 
ggagcctggc ctcagggagt agagagtaga 

15 

gcgtgtgcat agtgtagtgg tggttaagtc 
ctcctgaagc agaatgcact ctagccagtg 
20 caactcaggc catgggggcc agggcacagt 
tgtggacttc aggtagctct gtcagctgag 
cagtagtaag actgcaggtg tctatggtgg 

25 

ccttttccct gtaggaagaa ggccctcctg 
gtgatggaag aaaggtgttt tctctccttc 
30 tacaagattt ctgctactcc tttgctgttt 
gaatgtagtt gtttattcat ttttggtgtg 
tgtgtgtgtg tgtgtgttga tggcgatgag 

35 

acattctcta gaattctgaa gacatattat 
taaactttct tctattgtga ccattttctt 
40 caattttttt ccagttgact aattctattt 
caaaattttt tctgtcaaca ttattttata 
catattgtgt cactttttgc actttaactt 

45 

taaattatcc atacttttta catattctat 
gtgggacttc ttggaggctt gtatcaagca 
50 taagacttat tcagagacta attgctttct 
tactactaac tttgatagtt ctcagaacta 
tgatgcattt aatttctagg attatgatca 

55 

cttatctttg ataagtataa tctgaaatct 
gtttctacat tttcccccaa atttatcctt 
60 cctgacttat agagatttta gttgtttccc 
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gggcatggca 


atatcccagg 


ctgtgtaagg 


15720 


tgactaggcc 


tgtctgtagc 


tttttccttg 


15780 


cctggggagt 


cagggtttct 


cagaagcttg 


15840 


gtggggcact 


gcagctattt 


tgcctggatg 


15900 


ggtgcccaga 


gagtggagca 


cagtttcaac 


15960 


actaagatgg 


gggcatggag 


cagttccatg 


16020 


tagcagcagt 


ttggctcagg 


tatggcttcc 


16080 


tcagagatgg 


aggggtgcaa 


tggctactca 


16140 


attcagttcc 


aagaaggggc 


agcacagtag 


16200 


gtcagctcct 


cctctgggat 


agaacagaca 


16260 


ctcagagcct 


gtgaagactc 


tatgtgtctc 


16320 


tgatgggagc 


tgctgggatc 


ctgtcactaa 


16380 


actcagagct 


ggttctaact 


gggggaaggg 


16440 


ttcctgcagc 


cattctgggt 


ttctgtgctc 


16500 


tctggagctc 


ttccttagtt 


attttggttg 


16560 


tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgtg 


16620 


gttagtaact 


tctaattggc 


catcttgctg 


16680 


agacctttta 


attctaaact 


tctaatcttt 


16740 


gtttttctgt 


agaattcttg 


gtgatttctt 


16800 


tcagctattc 


cttgttattt 


aacacattgg 


16860 


tttgaaattt 


gtttgagaca 


atgttgcaaa 


16920 


ttcatttttg 


tgatttcatt 


ttcaattgaa 


16980 


atgtgatata 


tatccagatg 


ctcctaatct 


17040 


tatctacttc 


cttctttatg 


gaatccaatg 


17100 


aagcatgtct 


ttttgttggg 


atttctcctt 


17160 


taattctcca 


gggaagtgat 


aggtatcagc 


17220 


ataatttatg 


ctgtacacct 


caaggagact 


17280 


gtggccatag 


tgtcctgttt 


ctttgatact 


17340 


atgcaatctt 


tgtttttaac 


actcatttca 


17400 


actttcataa 


caaaatggag 


ttctaattct 


17460 
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gtttgtccaa attgtttttg gaacttttct 

ttggttacat tcttacattt aagtccttat 

5 tatttcttat tattaattat atttatcttt 

aatttgtaat tttgtacttt ttattttact 

taaaattaat gctctaaatg attaatcttt 

10 

ttctctgaaa ttcaccctcc ttctcccaac 

aatgtatttt cattgatcta tcagagataa 

15 ctgaattttt tctgtacata gcaaatttgg 

accaataaaa gggcacacat ctccaaacct 

atcaactatg acttgaagga tattagcaag 

20 

gccatgaaac atctgaatat caattcaaat 

cctatctcca cattacttgg ttggaatcat 

25 aatatctttg tgcttggaat ttcaaaaaat 

agttagaatt taaagcactg taataggaaa 

gactcagagc tgtaagtttc attcccagag 

30 

acatgggaag attttcagcc ctgagaattt 

atacaatact gtacaataaa ttggaaggca 

35 aaagtaatcc ttcatgggga agtggctgta 

aataaaatct gagcagactt gggatttgaa 

tctgtcaaca tgtcctctct cctagcatat 

40 

tctctctctc tccacccccc acttaatgtg 

ctattataat attggagggg aatgtagacc 

45 aaagtgtcaa gactttcttt tgtcatttcc 

gtagcacctt ttctgctgcc atgacaacca 

ctggccctga tgtgccccag ctgggaaaca 

50 

gattgcaaga gaagttcttc aagaggaaac 

tctgggagaa gaccaatagt gttgtaaact 

55 ctgtcccgct agcctcataa tacatcaacc 

gccctttctt tgaaaaaata tataatattc 

ttaaaatata ttgacttcag atctattcta 

60 

taacttcaat atagaagaac ggcaaaagaa 
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agatgagata 


agaataagaa 


cattgacttt 


17520 


cttaactact 


tcaatcacat 


ttacattatt 


17580 


cttttaaaaa 


tacttaactt 


cttattttat 


17640 


gcttttcata 


attttgaatg 


tatacattac 


17700 


tattttcatt 


ttaacttttc 


ccctctagct 


17760 


tttaatcaat 


tttattgcta 


gatagctaat 


17820 


atacacgatg 


acaattattt 


ccacagaaat 


17880 


actacaataa 


cttttttttt 


catttatgtt 


17940 


gacagttttt 


agcatatatg 


ctccactgaa 


18000 


cttcacttcc 


ctaaaatttt 


atggaagtgg 


18060 


acataggtaa 


aaatttccat 


gtggctttag 


18120 


tatactgagg 


agagacagag 


gcataaaagt 


18180 


ggctttatgt 


attcttgcta 


tgcaccagca 


18240 


agagatttaa 


atttgtggtg 


ccaagaatat 


18300 


ttcagagctt 


atgggacaaa 


atacaaggtt 


18360 


agctgtgctt 


tggttataaa 


aggaggaact 


18420 


gatgaatttg 


tgtagcttat 


tactattata 


18480 


atcaagagac 


taggcattaa 


aagggataga 


18540 


gttgaagaaa 


tttctctaat 


ttacctcact 


18600 


tattgatatc 


atataacatt 


taagtctctc 


18660 


ttgcctcttc 


cacttcccta 


gctctaacaa 


18720 


aggttttgtc 


cttagggagg 


aatctgcccc 


18780 


aggacctcat 


cctaaattac 


atttcttatt 


18840 


tgcaaggaat 


ggaacagacc 


actccagggc 


18900 


tagatgtcat 


acattcatat 


ctgtgtaaag 


18960 


ccaaagtcct 


tggggtaagt 


caattgcaaa 


19020 


cacgtttgat 


cagagctaat 


gcccaataca 


19080 


agccctagta 


acatcagaat 


ttaatxtcag 


19140 


tatcacatcc 


tgcttccaaa 


ttaattatat 


19200 


gggttgtggg 


tcagccatga 


taattatatt 


19260 


aaggcccacc 


tgtccactag 


ctgactcccc 


19320 
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ctccacaggc cttcctgaca gctgatagat 

tttcttagcc tactgctcag aggatgaaaa 

5 

agtctcaggc gaacgtgtct aaaaacaata 

gtgaaagaac attaaatata ggtttatcac 

10 tcactgtgga ttctttggat tccttctctt 

atatagttac ttctcataaa atatgttatt 

aatgtatgga taattatttt aaaatatact 

15 

aacaattgag agcaaagtac aggtatatcc 

cttttcaatt atcaacttcc cccaccagag 

20 gttgacacat cattactcaa agtacacaaa 

gttctatggg ctttaacaaa ggcatagtga 

atcccaccaa tccctggcaa tcactaatct 

25 



35 



aaagtcatac 


agttaacatt 


ataaattagg 


ataacatgca 


tttaaatttc 


ctctatgttt 


gtactgaata 


atatttcatt 


gtctagaggt 


aaggacatct 


tgatggcttc 


caaactttgg 


gtgtgcaggt 


tttgtgtgga 


cataagtttt 


ttgtattgta 


agagtatgtt 


tagttttgta 


ttgcaatttt 


gaagatttca 


gtcaacaatg 


ctagcatttg 


atttcctcaa 


tgttctggat 


atctcattac 


cattttaatt 


ttcatttccc 



tatgtttatt taccatctcc atatcttctt 

45 

cattttttta attgggatgt tagtattctt 
tgaataacaa tcctttgtca ggtatgtttt 
50 atcttttctt ctgacagtgt cttctgcaga 
atgattaatt cggtatttca tgggttgtgc 
aaaaccaagg tcatctagat tttggcctat 

55 

ttcaacattt gggtctgtga tttattttga 
gtctaagttc aatttttgcc tatggaggtc 
60 aagactattc ttctaattgt attatcttag 



gtatattgct 


actctttctt 


tcttgcagct 


19380 


atctgctatt 


gtctgagcat 


ttccttctgg 


19440 


tagaattaaa 


acagtgacag 


gttagggaag 


19500 


tagcccaagc 


ttcaaaccac 


tgagaaaatg 


19560 


tttttgcatt 


attcattttt 


ttggagagaa 


19620 


tatgttggca 


catactaggt 


tcaattttta 


19680 


gtatgtttta 


gagcagtttt 


aggcttaaag 


19740 


catatgcccc 


ttgccccgac 


acatgcatag 


19800 


tgggacattt 


gttccagttg 


atgaacctat 


19860 


tcacaatggg 


gttcattctc 


tgtgttgtgt 


19920 


tatgaatcta 


cctttatagt 


atcatacagt 


19980 


tctatcgtct 


tcatagtttg 


ccttttccag 


20040 


tagtttttaa 


agactgcctt 


ccttcaccta 


20100 


tcttatggct 


taatagctca 


tttattttta 


20160 


accacagttt 


atttatgcat 


tcacctacag 


20220 


caattaggag 


taacgttgct 


ataaatatct 


20280 


caactcattc 


gggtaagtat 


gattactata 


20340 


agaaactgcc 


aaactgtctt 


ccctgttact 


20400 


aaggagagtt 


cctgctgttc 


tacataattg 


20460 


tttttaccat 


tctaataggt 


gtgtagtcca 


20520 


tggtgacata 


agatgtgggg 


catcttttta 


20580 


tcttgaggtg 


tcttattaaa 


gtttttgaca 


20640 


aatgttgagt 


ttgaatttcg 


ttgtatattt 


20700 


ttgcaaatac 


cttctgtcag 


tctgtggttt 


20760 


gcagaatttt 


ttaattttaa 


taaaattaag 


20820 


ctttggtatt 


gtattgacaa 


actcatcacc 


20880 


gtcatctgtt 


aagagtttca 


gagttttgag 


20940 


gttaattttt 


gtgaaggata 


caaggtatgt 


21000 


cagtttttta 


agtagtattt 


tttttttgga 


21060 


ttctttatca 


aagatcactt 


gactatactt 


21120 
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ctgtgtctat ttctaggctt tctattctat 
aatatcacac agtcctgatt actatcactt 
5 ctcctaaaac tccttcttct tcaagattgt 
tataaacttt agagtcagtt tatcaatata 
attttgagat ggagtctggc gctgtcaccc 

10 

cactgcaagc tctgcctccc gggttcacac 
gggactacag gcgcctgcca ctatgcctgg 

15 gtttcaccat gttagccagg atggtctcaa 
cctcccaaag tgctgggatt acaggggtga 
tgagggagat tgcattaaat ctacagatta 
tgagtcttcc tatccatgga catggaatat 
ttttttaagt ttctcttata gattttgtac 

25 ccaaatacca atacctaaat atttaatttt 
tttaatttca aatttcactt atttactata 
attaagcttg tatcctacaa ccttaatata 

30 

gattctttca gatattctac acagacaatc 
tttttcctta ccaattgtat acctttattt 
35 cttccagtaa atggtaaaag gagtagtgac 
gtgagaaagc tttgaatttc tccattaagt 
attctttatc aacctaaaca agttatcctc 

40 

aaacaggcat tggattttgc tcaatatttt 
tttcttcctt ttagcctgtt gatgtcatta 
45 cagccttgca tatctgggat aaatcccata 
tattagattt agtttgcaca tgttttgttg 
gttggtctgc agttttcttt tcttgtaatg 

50 

tgctagctcc atagaatgag ttagggagta 
tggtataatt tcattttcaa gtgtttggta 
55 tctgttttga aaattaatta attatttatt 
gagtgcctat ttcttcttgt gtgaattttg 
ctcaaggtta tcaaattgtg ggcacagaat 

60 

attactatgg gaactgtagt gatattcccc 
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tcccttgatc 


tatttgtcta 


tttctttgcc 


21180 


tatagtttgt 


cctaaagatg 


ggtagtgtta 


21240 


ttttgttatt 


ctaagtcttt 


tgcctttccc 


21300 


cacaaataat 


ttgctaatat 


atatatatat 


21360 


aggctggagt 


gcagtggccc 


aatctcagct 


21420 


cattctcctg 


cctcagcctc 


tggagtagct 


21480 


ctaacttttt 


gtatttttag 


tagagatggg 


21540 


tctcctgacc 


tcgtgatccg 


cccgcttcgg 


21600 


gccaccacgc 


ccggccattt 


gctaatattt 


21660 


aattgggaag 


aactgatgtc 


ttgacaatgt 


21720 


ctctccattt 


atttagttat 


tcgacttatt 


21780 


atagtttgtt 


agatttatgc 


ctaacaaaca 


21840 


tagggttgct 


aatttaaata 


gtattgcatt 


21900 


agtacataag 


aaagtaattg 


actcttgtgt 


21960 


atcatttgtt 


aattccagga 


gtttttggtt 


22020 


acgtcatctg 


tgaataaaag 


cagttttatt 


22080 


ccttttcttg 


tcttaccatt 


tcagctatga 


22140 


aggcttatcc 


ctgccttgtt 


cctgatctta 


22200 


atgaatttag 


ctgtagtttt 


attaataaat 


22260 


tattaatagt 


ttttcaaaag 


tttttataat 


22320 


tcctgcatct 


gttgatataa 


tcatgtgatt 


22380 


gttacattca 


ttgatttttg 


aatgtctagt 


22440 


tggttatggt 


atataattct 


tttgatacac 


22500 


ggaatttttg 


catctttgtt 


caccagagat 


22560 


tttttgactt 


ttttttttgg 


tagataatag 


22620 


tttcctcttt 


ttctgttttc 


tgtaaagaat 


22680 


gaattccaaa 


cctttccagg 


cctggtgctt 


22740 




^nf'^nff'if'^ 

adCdd u uoLa 






acaaattgtc 


tttaaggagc 


aattcattgg 


22860 


tgttcgtaga 


gttattttat 


tatcctttta 


22920 


ctttcactta 


tgagtaatct 


gtgtcctctc 


22980 
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tctctttctc ttagactggc taaaggctta 
taaaaaaaag cattggattt cattgagttt 

5 

atttctgctc taatttctat tacttttttt 
ttctttttcc agtttcttaa ggtagactct 
10 atatattcat tcaatgttat aaattttttc 
tttgataagt tgttttaatt ttttgaccca 
gctcctattt taagaacgct gtttactcat 

15 

aatatcagat ttgctcttct ttttttcttc 
cttccccttg ctacaataat tgctttctaa 
20 caaaagagag aaacagttgc ggtagaagta 
gctggttctg gcatactcct tttaaccgtg 
gataatattg gtaagataat attcatggtt 

25 

tatcaaggtc atatgactat atcaaatgaa 
gggagtggca taagcaccat ttttgtggtg 
30 ggttgtgcgg attctgattg ccttgatgag 
tgcatttagt tcttatgaag aacatcccat 
gtcagtgatg gtaagcaggg tattttttga 

35 

aataccctga tctttgcaat tttattctaa 
tggtatctaa cctttttaaa ctcaattttc 
40 acttaaagct aggcgattaa atactttata 
attgccaaag acccaaactg gaatgactta 
agattagggg cttatcattt ccagaattgt 

45 

gactgctagc agactatacc agaggggaga 
tttatataca tatgcccaaa tcaactatca 
50 tagactgatt aagatttacc tctgtagctg 
aaaaactgaa gaaaatgatg gtttagttag 
acagtttcta atatagagtc tactgactat 

55 

ttaaattgtt atttgataca ttttgtttca 
ttaaattaaa attagaattg gaatttaagt 
60 gagagttttg atgcaaattg aaagtaaaat 
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ttgattttat 


ttatgtattt 


ttcttttttt 


23040 


gtctattaat 


tttctgtttt 


taattttatg 


23100 


tcttccgctt 


acttggagtt 


taatttgctt 


23160 


caggtgattg 


attttgatca 


ttattttcta 


23220 


taagcactgc 


tttcactaca 


tcccataaat 


23280 


tttataatgc 


catgagataa 


ttgtactcca 


23340 


gttcatgtcc 


tcaggcccag 


acgtcaagaa 


23400 


ccttgtatac 


actccttttt 


cttcaggtta 


23460 


ttccttggcc 


acttttctct 


gagtgtgtga 


23520 


gcatcctagg 


tatcagaaaa 


gcacctctgg 


23580 


aaatgattga 


cagtggtaga 


gatgctgaga 


23640 


ctttatgtga 


aaaaaagagt 


gacaacagaa 


23700 


aaagaataag 


gatagcttag 


tgattggaaa 


23760 


gataatcagt 


ttttgctttc 


atcatttcta 


23820 


ccttagcatg 


ggaataataa 


tgatgtgtgt 


23880 


ttttgtgtat 


gttgcgtaca 


caatttgggg 


23940 


tataattgga 


tataacataa 


tagaggtgta 


24000 


ttctatcttg 


taattctgta 


aatttgagaa 


24060 


tcatatttaa 


aatagtaata 


ataatataaa 


24120 


taagtcagtt 


aggatatatt 


acagcaacac 


24180 


agcaatgaaa 


gcactgattg 


tttcacaaca 


24240 


ttaatttggt 


ggtgtaagga 


tttcatcaag 


24300 


tggtagacac 


atctcatttt 


tgataaatat 


24360 


gtaagggtaa 


tggacttgcc 


attactggct 


24420 


ggaggagtct 


acactacctt 


ttagctcagg 


24480 


caaggaaaaa 


gagatgtggc 


tggtaaacag 


24540 


caaatctcag 


gaaattgtta 


ccttagaaaa 


24600 


taagtttaat 


aacagtgact 


atgattaata 


24660 


ttaaaaacct 


tttcccaaga 


gaaaacacaa 


24720 


tgagaataaa 


ataccaagtc 


ttacctattg 


24780 



-160- 



WO 02/062946 

acaaaagaaa aataaaattt atgaggcggg 
cattataggg tttccattta tttccaaaac 
5 agagcttaat ttttgctccc cttcactcgt 
gtgtattagt cagggttctc tagagggaca 
aaaggaagtt tattagggag tattgagtca 

10 

ccatctgtaa gctgaggggc aatgaagcca 
tctgatgttt gagggtagga aacatgcagc 
15 ccagtctagt gtttctatgt tcttctgcct 
tagattgtgc tctcccagat tgagggtggg 
ttaatctccc ttggcaaccc cttcatagtc 

20 

atccaatcaa gttaacattc aatattaacc 
ccacacacat ctcctgaagt catacatgat 
25 caaaattatg cctaacataa tgcagctatt 
tatacaaatg ctattaccta aagttaacaa 
cttatgtcac atggtaaaga aaaaagaaag 

30 

aatgtataca tgcacaaaca tgttcttaac 
cggtcttcat ttctgcaact gatcacatgg 
35 taccaccgat tctgtattcc ctttgccttc 
ttttcctggt ggagtgacca aaaccttcat 
tcctgcccag attgggttgt tgtagtttcc 

40 

ctaagagaca ccctaaggga tctcctgtat 
agtagtagac taattttatc ttgatagtcc 
45 cccttcttag ctggttgact tagaggtaag 
aacttccagc ttaatgaaat cattgttgtc 
taagacctct aggccagcag aacatagtgt 

50 

taggtcacta ggggtgatgg taagtggtgc 
cccacaaatc ttggctatag cagaaacagt 
55. cacagccttc tggagaacat tgccccagcc 
taattgtgac ttcaaaagga cattccacca 
ggaaacatgg taagaccagt gaattccatg 

60 

caacaagtga gtgccttggt cagaggcaat 
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tttatgttag aatcaaggaa tctattctag 24840 

tgagaatgag acagtttaga gttttataat 24900 

gtacagcgga agtgaaagat ttgaaggaga 24 960 

taactaatag gatagatgtg tatatatata 25020 

cacaatcaca aggagaggtc ccacaatagg 25080 

gtttgagtcc caaagctgaa gaacttgaaa 25140 

acgagagaaa gatgtaggcc agagactaaa 25200 

acttttattc tggccacact ggcagctgag 25260 

tctgcctttc ccagtccact gactcaaatg 25320 

acatccagga atgatatttt gcatccttca 25380 

atcaagagtc caccccttgt caacctgaac 25440 

tttcaaataa agaaaataaa gacaaaaggc 25500 

ttttgtacaa cccaaaatgc accaatcccc 25560 

cacttaaatg ctaatatgaa gttaataagt 25620 

aaaataaaat gaagatattt tcttagtaca 25680 

aaaagaagga ggaaatactc atgacaatta 2 5740 

ttgtagctgg tattgatgac agccttattc 25800 

aggaagcacc tcagcaggtt gtgttttttt 25860 

tccagaaggc ctagtctggg ccatttgtag 2 5920 

cattgacctt aatcataggg catggtaata 25980 

tccacacata cttccttacc tccattgtgg 2 6040 

aggtcagtca ccctagccaa cactgtaact 26100 

aggagcccaa aatggccagg tggcaatctt 26160 

tcccggtggc aacgttcctt cctctggaac 2 6220 

tgtggaaaca ggaagcaaaa attttgctgg 2 6280 

cacttccact tatacccctt gatttctgga 26340 

accttatatt agacgctgat tcagagcatg 26400 

ctgaaaagta ttgtcacctg gttgatgttg 26460 

ttctatcaat ccagctgctt caggatgatg 26520 

agaatgagcc cactgccgca tttctttagc 26580 

gctatgtggg ataccatgac agtggttaag 26640 



-161- 



WO 02/062946 

gcattccatg agttcttgga ttgtagtctt 
cccatatcca aagtaagtgt ctattctggt 

5 

aagaggtcta atataatcaa cttgccacca 
gctatattga gggctcagtg ctggtctttg 
10 catagccagg tcagtgaatg gaagtccatg 
gccaccatgg ccacttcgtt catggactca 
aggctgactg gtgtccatag aatgagtcat 

15 

tgctgaggtc accgtttgat aagcactcac 
ccactcggag aggtccattc acatacctct 
20 atcatgcttc ttccaagtcc ctgactactc 
agtatataat tgcagatctg gccatttctc 
tgctcaacat tctgcccact gggacgattt 

25 

aggaaggggc tgtagtgctg cagctgacca 
gccatctgta aacagggccc ttgtcttctc 
30 atgagaccat cagtgcaggc tgggggagag 
gcatttgagc cacttcctca tataacttac 
cacatatatg ccacttccat ttgatgatgg 

35 

tagacaagtc agaaagcact caattcatga 
gacccatagc caaccgttta gtttctacca 
40 aaaaaggaaa gtagttatct gcagaagatg 
tctgctgtga ttcatctatg ggggacttcc 
ctgacacctc aagcacaatt agatctgctc 

45 

gcatagcagt ttggacctgt tgcagagcct 
aagcctttca ggtctcttga ttaatgggcc 
50 cctccaaaac ccaaataagc ccactaggca 
gggcaaatac agcaacttat ccttcacctt 
ctgaacccct agaaatttta ctgaggtaga 

55 

ccatcctctg gcatgcaaat gtcttaccaa 
actggatccc atcagtataa tggcatcaat 
60 ggaaaagtga tcaagatctc cctgaacaag 
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gccagaagca 


ttgtgtgcag 


gataggcaaa 


26700 


gaggaaaaac 


cgctgccctt 


tccgtgatgg 


26760 


agtagctggc 


tgatcattcc gaggaatggt 


26820 


ctgctggcaa 


attaggtgct cagcagtggc 


26880 


ttgctgagcc 


catgcatatc 


ctccatccct 


26940 


ttgggcaatg 


acaggggtgg 


ctggagaaag 


27000 


cctgtttact 


tgattattaa 


aatcctcctc 


27060 


atgaaataca 


aatatcttca 


cagtttttga 


27120 


tcccaaaatt 


tatttgtcac 


caattttcct 


27180 


agccaaacca 


ttggctatag 


cccatgaatc 


27240 


cttccaagca 


aagtgcacaa 


tgaggtgcag 


27300 


tccttcaccg 


ctgtccttca 


gggatgtcct 


27360 


ctttgaggtg 


gtgcctgcat 


atcgtgcaga 


27420 


ttcctgttaa 


ttgatcctgg ggaactccct 


27480 


aaggcagtgt 


ggcgggaatg gggatcatga 


27540 


ttgtgccctc 


aggagttgct 


caagcctgat 


27600 


aatgctgctg 


tgcatgcccc 


actttatggc 


27660 


taggtagttc 


aggtcgcatg gtgacttgat 


27720 


aagtccagta 


acagggcaag 


agctctttct 


27780 


gcagggcctt 


cctccaaaat 


cctagaggcc 


27840 


aaaggctcct 


aacagcatcc 


ctctctgctg 


27900 


tgttctatgg 


cccaagtgac 


agagcaactt 


27960 


tctcctgtcc 


tggaccccac 


tcaaaactgg 


28020 


aagtaacacc 


cccaggtaac 


gaatgtgtgg 


28080 


ttgtgagtct 


ttcttggttg 


tagggggtgg 


28140 


agaaggaaca 


tcttgacagg 


ccccacacca 


28200 


aggtccctga 


attttagtca 


tatttatttc 


28260 


taagtccagt 


gtgcttgcta 


cttcttgctc 


28320 


ataatggacc 


agtgtgatat 


tttgtggaag 


28380 


gtcatgacac 


aaagccagag gagagttgat 


28440 



-162- 



WO 02/062946 

atacccctga ggtaggacag taaaggtaca 
cttctggtgg gccttataga caggaatgga 
5 ttaccaggta ccagaagatg ggttaatttg 
aactgcaact ggagtcacta cttggttaag 
tccatctgtc ttctgcactg gccaaatggg 

10 

cacccttgca tcttttaagt ccttgatggt 
gggatattgt tgttgattta ctgcttttct 
15 ggccttttcc accatagtag ccttcactct 
cagctgctaa gcatgtctgt gtcaattatg 
tgagtgtggg gacccactgg agccactgta 

20 

acatgacctc cataagcccc tactataact 
ggaatcaatg tcagctcaga gtcagtgtcc 
25 ctttctttcc ccaatccagt tacctggtaa 
gagaaagatt aacaacataa attgtcagta 
ggcctccttt tcattcaagg ggttctgggt 

30 

agtggggctg cgattctctg tttttataat 
aagttttctg cttatgtaaa ttaagtaaga 
35 taggaacgct gtaattaatt agccaatacc 
tgctgctttg cctcttctgt ctattatggt 
gtgctgccat ttggcccctg ctgtctctag 

40 

gtagttgagt gactacagtt tctactgtaa 
agctcttcaa ggatgcaagt gctgccctct 
45 gtatatcttc tgaaccctcc cagctgggat 
tcatcacaat ctgcctaagc ctttggatcc 
cattttcagc tcacccacag tgggccatct 

50 

taaactagaa cttttttaac tctgcaagct 
gggcccaaat caataaattc agcctgatcc 
55 cccttaatat tcattcccat gcctgttttc 
caagcagttc ttttcaagtg tagcacactt 
taggggcctg ctgggacttt agttataggt 

60 

tcctaaggag aatcaacatt gtcttgcttg 
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ttgctggtct 


tgccagctga 


aggtaaattg 


28500 


gaaaaaggca 


tttaccaaat 


caatggctgc 


28560 


ctcaagcaat 


gaaaccacat 


ctggtacagc 


28620 


ctcatgataa 


tccactgtca 


ttctccaaga 


28680 


agagttgaat 


ggggatgtgg 


tgggaatcat 


28740 


ggcactaatg 


tctgcaatcc 


cttcagggat 


28800 


aggtagaggc 


agctctaatg 


gcttccattt 


28860 


atgagtcagg 


ggccagtgtg 


ggggtcctga 


28920 


cattctggca 


ctggggaaat 


gaatgcagga 


28980 


agttgaacct 


gagctaaagc 


tccattaatt 


29040 


ggaggatcac 


aatgactttt 


tgggtcccct 


29100 


agtagttccc 


aaaaggtctg 


atcatttttt 


29160 


aaggccagag 


gtctccttga 


ggaaggatgg 


29220 


gtatagtggg 


atccttcttt 


aagaggacct 


29280 


ctgtaaaatg 


gcatatgtct 


ggaaattgat 


29340 


tcaaattagt 


gttttgtcca 


ctcgacctgg 


29400 


attcggtaag 


cttcctgtca 


atttcacttc 


29460 


agagctctac 


acaagtcaga 


ctattctgat 


29520 


agctatgcct 


acctttcctt 


tgatggctga 


29580 


atccaattat 


ttttattgea 


tttaaatttt 


29640 


gatctgacat 


acagagaaag 


gcaatcacag 


29700 


caaatctatt 


tcacaaagta 


ctggtcaagg 


29760 


gaataggctt 


aaagtgacta 


atccactcct 


29820 


cttcctctac 


attaaaccaa 


ggagatcagg 


29880 


tttaatccat 


atttcagcta 


accaagcaaa 


29940 


gcaacattaa 


atgeagaate 


ectgettaga 


30000 


aactctatgt 


tcttccacca 


ttatcccaca 


30060 


caaattt eta 


cttatataaa 


ttaaaaaact 


30120 


tcttgtgtat 


cgcattctga 


acctcacctc 


30180 


ctagaagcaa 


acaggggect 


tgggagtggg 


30240 


gcaacacctg 


aggagaaggc 


catcactatc 


30300 
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ggggaacctg ccccgatagt cacgtaggtt 
ttgagaaata aagggacaga gtacacaaaa 

5 

aagacatcac atgtcgatag gttccgtgac 
ttattaggga gtttcaaaaa gggagggagt 
10 gtactgtaca aggtaataga atatcacaag 
cacagtaccg gggtgaaatt aaaattgcta 
acatcttatc aggagacagg gttttgagag 

15 

ggaatttcct cttcctaata agcctgggag 
ctacagcctc gaccataaga gaccggcaca 
20 caggcgcgta ttctctttcc cagggatgtt 
tctcccattt gcttttgaaa gaagagaaat 
tcagaattta aggttctctc tcttattccc 

25 

caaggtgccc agatttcata ttgtttaaac 
caattattac agggtcctga ggcgacttac 
30 aaagtaaaga cgggcatagg aaatcacaag 
tgaaatcttt acaatttatg tttagagatt 
agtattaatt tggggaactc ataaatgtcc 

35 

catggcttca gccggtccct ccgtttgggg 
cattaggcag cgcagggttt atctccttag 
40 gttagggagg ggatgttgcc actactggtg 
aggctcctga gagtttacaa gctcagtgtc 
cccattccaa gttgcagggt cccattcttt 

45 

cacctggtga gtctatgcat gcaccttttg 
ttgtgtctga gtgtccacaa tttcagctct 
50 ggcaatctta gaagatttga ggctcagtat 
gagttcataa ttttctttca tcactttgtc 
aactccatcc ttaatattct gtttttctag 

55 

tcagggttct ctagaggaat agaactaata 
ctaataggat agatgtatac ataaagggca 
60 tagtctttcc atgtttttct gcctcctttt 



cttttctgtt ttccctaagc gttggctggc 303 60 

gagagaaatt ttaaagctgg gcatccaggg 30420 

accccacaag ctgcaaaaac cagtaagttt 304 80 

atgcgaatag gtgtgggtca cagacatcaa 30540 

gcaagtggag gcagggcgag atcacaggac 30600 

atgaagtttg aggcaccatg gtcattgata 30660 

caactggtct gaccaaaaat ttattaggcg 30720 

cgctatggga gactggggtc tatttcaccc 3 0780 

tctggggggc catttatagg cctatacccc 30840 

ccttgctgag aaaaagaatt cagcaatctt 30900 

atggctctgt tctgcccggc tcaccggtgg 30960 

tgaataattg ctgttatcct gttctttttt 31020 

acacatgctg tacaatttgt gcagttaatg 31080 

atcttcctca gctgacagga ttaagagatg 31140 

ggtattgatt gaggaagtga taagtgtcca 31200 

gcagtaaagg caggcataag aaattataaa 31260 

atgaaatctt cacaatccac attcttctgc 31320 

tccctgactt cccacaacac atcactgttg 31380 

acaaaggtga aaaggctgat ggcagtgtgg 31440 

atggggaagc tgttttttct ggcaaaaaaa 31500 

cccagcttca tcaaggttct accgcacatc 31560 

ttcaatcaat gctctcgctt taacagtaga 31620 

ttgcaggtca gcaaatcgca tgataaaagc 31680 

ttctctacag aagataagac tctcactcag 31740 

ctgcttctga agcttagagt tagaatccct 31800 

cagtgaactt aggagcaaaa ccaaaaaaag 31860 
atcatctcct ggtaccaaaa tctgtattag 31920 
ggatagtcag ggttctctag agggatagaa 31980 
gtttattaag accagaagac taaatgagtc 32040 
attctggcta tgctggcagc tgattgactg 32100 
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aggatgggtc tccctttcct agtccaccaa 
tcacagacac acccaggaac aatactttgc 
5 tattaaccat aacagacagc aaacactgtt 
tacatatgat atataaccta tatatataat 
ttattaaact cttactatga tctggacatt 

10 

tttcacatca tcaccacaaa cttttgaggg 
gaagagaggt taagtaactt gctcaagatc 
15 atagggtcac tctattatca gaatcacata 
tccattttct tttcaagatc acattgaaaa 
tgttcagtag attaccaaga actgcaggag 

20 

ctcttgtagg cattgaacta gattggtaaa 
gctaccttcc attattcttt caaccttttt 
25 tttctccctt ttacagtata agtaccagta 
ttggcctctt cagttctatt cctatttatg 
agataccata tacatatttt ttcctgtggt 

30 

caaatacaca aacatacata cacacttcta 
ttttccaagg tgctttactg aattttttaa 
35 ttggatgttt gcaatatatg ttattcatac 
caactatttt tcttatataa ttattttggt 
tctacatgtt tatcttgatt ttgactttaa 

40 

aaacatgata aagacttgta gactgagaca 
tatttatatt ttaccaataa tgtttaagtc 



60 



tatcctatca 


acactgtaaa 


catgagtagc 


tttattatat 


atattagaat 


atttcaatat 


ccaaatgttc 


atgatgtatt 


ttatgttaca 


tttccaatgg 


tgatttagat 


aacatgaatc 


gaagtttatt 


gagctatttt 


attagtccag 


catctgagat 


caaatttgta 


aaggaaataa 


gagagtccaa 


ccaacttgat 


tagctatttt 


ggattcagat 


gaagtagtat 


attctagtta 


tactatttta 


ttttataact 


ttttaaacat 
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ctcaaatgtt 


aatctccttt 


gtcaacaccc 


32160 


atccttcaat 


tcaatcaagt 


tgacactcaa 


32220 


ccatctctgg 


aaagttgcaa 


tatatatata 


32280 


atatatatta 


ccaatgtcaa 


taatagcttt 


32340 


ctgctaagtg 


ctttttatgt 


gtacttttta 


32400 


aggatttctc 


attatcttca 


ttttatggat 


32460 


acaactgttt 


ggtacaggat 


cagggttcaa 


32520 


catgtaacta 


cctaggtttc 


cgctttcatg 


32580 


ccagcttaca 


tggttttagc 


cttccacacg 


32640 


ctgtgtttgt 


ccactgagtc 


atcactcaga 


32700 


tgcagtaaat 


tggtaaaacc 


cttttttgct 


32760 


ttccgaacat 


cacctgcaaa 


gtgcctctct 


32820 


ccaagagcca 


gctgtaagtt 


tgtattatcc 


32880 


tttcacattc 


tttgttaagt 


ttttcctctg 


32940 


aaatgaaatc 


ccttttttca 


atcctatctt 


33000 


atatgtttat 


ctaatttctg 


tatataacat 


33060 


agagtttatg 


gtacttttta 


gtttatattc 


33120 


ttaaataaca 


gtagttttag 


ccatttgttt 


33180 


taatataccc 


caaatagtaa 


tacataatag 


33240 


taagatggtc 


ctactttttt 


tttaatcatt 


33300 


gatgtcctat 


cttgttacag 


aaagtataat 


33360 


actaataatg 


ttggattgta 


ttaaatgtcc 


33420 


aattaacagg 


taacgttttt 


cttaaattta 


33480 


taaaaatctt 


tgtagagttg 


aaggaacatc 


33540 


tgatggatga 


tttgctctac 


tgttattttt 


33600 


atctctatgt 


ctgggtattt 


ggaaactaaa 


33660 


tttctttata 


ttaggatata 


atttttgaat 


33720 


aaaactctaa 


taatggaata 


ttataaaata 


33780 


ggctaaatga 


aaaataattg 


ccccattgat 


33840 


tcaaataact 


gggaactaaa 


ttttatgtgc 


33900 


attttcaatg 


agaattattt 


tagtacattt 


33960 
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tttgtttatt gtattcaata ttaataccat 
agctttatct ttcagtgtcc tcaaacagtt 

5 

aaggcttgat aaattatccg gtggactcat 
aagagaaaat tatgtgcaat ctcttgcttt 
10 atctaatttg gaataaagtt tactaagttg 
tctccatttt tttctctcat tcttatttta 
gaattagaac aacaaaaggt ctggtgagta 

15 

attaatcaat aatgtattct gccagggtat 
aagtcaggag caactatgca ttttacttag 
20 taatgataga attactgaca taataatgga 
tttatgaaga tatggaaaat tatttagaac 
ctgcagcttg ggattcagcc catgacagtc 

25 

ttgaccaatt gattaaaagg tagattctgg 
aagactttgg gaggcctagg tgagtgggtc 
30 gccaacatgg tgaaacccct tctctactaa 
gggcgcctgt aatcccagct actcaggagg 
ggcggaggtt gcagtgagct gagattgtgc 

35 

actccatctc caaaagggaa aaaaaaaagt 
aattttcaat ataagaaatt cacaataaaa 
40 gtaactctca tgtcaagatg cctttcataa 
acaagggttt ttctggctaa acactatttg 
actctggagc atgatcaata tcggttttcc 

45 

tgccgggacc tgcagccaca ctacactctt 
aaaaagcctc cttaccttac ccatgagaag 
50 aaaaataatt taaaatgtag agaggcatga 
agatcgggca ccagaagcaa ggaatggcag 
tgtgtaaaaa atcattgaaa accctgataa 

55 

ggaattatct gaagaattta agctggagaa 
gatcagagtt gaaaccacta taaggagaca 
60 tagacagcag tgtaaagtaa ttaacagaga 
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ataagaatta 


cttcttatca 


ggaatttgga 


34020 


ttaatatcat 


tccagttagc 


tattatttga 


34080 


ctggcagaat 


ttttctgttc 


tttctgggaa 


34140 


ctaagataat 


tgacctgttt 


atacatatct 


34200 


tgatttccaa 


tacttgttta 


aaacttttta 


34260 


ttatttcagt 


atccttatca 


gttgcagcag 


34320 


atgttttgtg 


gggggtatca 


aaaggaaagg 


34380 


aaaacaggtt 


ttgtttgttt 


gtttttaaaa 


34440 


caggatctgt 


ctcaaaaatt 


acgataaggc 


34500 


tattttgtca 


aggtagttga 


tgagggaatc 


34560 


aaatatgaag 


tgggatcact 


caaaaaagct 


34620 


acaccaagta 


ttggtgtggc 


ataatgttta 


34680 


ccgggcgcgg 


tggctcatgc 


ctgtaatcct 


34740 


atttgaagtc 


aggagtttgc 


aaccagcctg 


34800 


aaatacaaaa 


aaattaccag 


gcgtggtgac 


34860 


ctgaggcagg 


agaattgctt 


gaaccctgga 


34920 


cactgcactt 


cagcctggtg 


acagagcaag 


34980 


agattccaac 


tgtatccaaa 


aaggattttg 


35040 


ttattacata 


agaagcagca 


aatttaaaat 


35100 


agccaataaa 


atgcaaaagc 


caagatctca 


35160 


agttttcttt 


gttcaaaaga 


taaatatcaa 


35220 


aaatctggcc 


taacccaact 


tcctagtgtc 


35280 


cacactttcc 


tgtgtaattt 


taatacttat 


35340 


agaaagtaca 


aaaaggaata 


cacaaccgag 


35400 


gaggcaagca 


gtgtttagaa 


caatgataag 


35460 


ggaaggacac 


agacatgact 


gtccagctaa 


35520 


atgatttgga 


gattatctca 


tcttctcaca 


35580 


taatgggatc 


atatttgtgt 


tttggcgaga 


35640 


agttagaaga 


ctattgaaat 


aagataaagg 


35700 


tatatctgag 


gatgatgttg 


gattgcttaa 


35760 
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ttaattttca tttctttcta ggttggaggt 
ttggctatat cagggatctt aatcaatgca 
5 cattactgta accacgatca gttgtcaagt 
gtgagtgttt tctcttttca tgagatatta 
ctggcaacaa caataattat catttcctaa 

10 

ttgagaaata gaataatctt tcatttagga 
ttttcctcta tatgtgatga ggcaaaatgg 
15 tccttaatag ttactgggct gggcattttg 
tgatgcagtg tctactgtta tctcctccag 
gtcccagccg gacacggtgg ctcacgcctg 

20 

gcggatcatg aggtcaggag atcgagacca 
actaaaaata caaaaaatta gccaggcatg 
25 ggaggctgag gcaggagaat ggcgtgaacc 
agtgccactg cagtctggcc tgggcgaaag 
aaagatgtca gtgcctattc tcaaataata 

30 

agtggtaact cagtatcctt tcccacatct 
atcctcaaca tgtggcacct gaaaaattct 
35 ttgtaccagt tgtccttatt tcaatgctat 
tacacaacat tttgtaagta ataaaagttc 
gggaagagtt cttgcagcac atcacagcag 

40 

attattatta ttattgttgg agatggagtt 
gcagttcaca atgcacgtgg tggcacaatc 
45 tcaagtgatt ctcctgcctc agcatcccaa 

gcctggctaa tttttgtatt tttagtggag 
cttgaactcc tgacctcagg tgatccaccc 

50 

ggtgtgagcc actgtgccca gccaaatctt 
acaccaccaa actttcagtt acttttcctc 
55 cttagttctt ttcctcactg tgcccttccc 
ctaacttact agacagaatc aactgcattc 
agtcttgagg gctacgtagt cacaccaaaa 

60 

tttacttccc ttgagatatc taatgctttt 
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agtctaggaa 


agaatatcac 


cagttcagtc 


35820 


ataagcttga 


cgttttattc 


attccgttac 


35880 


aattgttaca 


tgactatgtc 


cattttaatg 


35940 


tcatagaagc 


agtttagggg 


agtgctggct 


36000 


ttgctgaatg 


actttggaag 


ttgtaagtga 


36060 


gtcaggtaca 


catctcatct 


ctgtctttat 


36120 


gaagaaagct 


tatagttata 


gaacacctac 


36180 


cataaatttt 


tctgtttaat 


tttaaaacat 


36240 


acccctctaa 


aaaatggcgt 


caaaaagtca 


36300 


taatcccagc 


actttgggag 


gccgaggcag 


36360 


tcctggctaa 


catggtgaaa 


ccccatctct 


36420 


gtggtgggca 


cctgtagttc 


cagctactcg 


36480 


caggaggtgg 


agcttgcagt 


gagccgagat 


36540 


agtgagactc 


tgtctcaaaa 


aaaaaaaaaa 


36600 


aagtaaaatt 


gtcttgaaca 


gtattttata 


36660 


ttggatgttc 


ttgcaacgtt 


aaccactttt 


36720 


caatctccat 


tctgaatttt 


actacttttt 


36780 


ctcaattgct 


ttattctgaa 


aattaccact 


36840 


ttgatgtttt 


gggccattgg 


gtaagtgtgg 


36900 


actaaaatgc 


acaatttaat 


ctttataatt 


36960 


tcactcttgt 


tgcccaggct 


ggagtgcagt 


37020 


tctgctcatt 


gcaaactcca 


cctcctgggt 


37080 


gtagctggaa 


ttacaggcat 


gtgccaccac 


37140 


acggggtttc 


tccatgttgg 


ccagcctggt 


37200 


acctcggcct 


cctaaactgc 


tgggattaca 


37260 


tattattaaa 


aagcctactg 


gtcattttca 


37320 


tctatgaact 


ttcagccttt 


cgccttgctg 


37380 


t~ r*a t~n t"nt" a t~ 
L- a L.y L.y v.uL 






37440 


tgtttacttc 


atttctagct 


aatcccacat 


37500 


taaccgcttg 


aggtgtcttt 


gcgaaactaa 


37560 


tatagtaagt 


aattttattt 


cttgccactc 


37620 
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actcaagata agaaaagaaa gtttcttttg 
agttagaaat tggcaaaagc aagattcaat 

5 

tatttgctat tgctatcccg aagaagaacc 
catagacttt aatggcacaa tgatagagac 
10 tgatttcgtg ggcatgggac ctgtgcagtt 
tgcttggtct aatgctctgc tgttgctatc 
acactctgta aattacgtag ctggttctgc 

15 

tgatgaattc ctattaactg tgtgaacatt 
cataagagct ttctggaatt tgaagcaatg 
20 cagtccacag gtctatgagc ccatagtgta 
acataagaaa gagaacagtt ccctaccaca 
atagaaatgc tggaaagcat ataattatta 

25 

aaagtagtta gagtctacca cccggaatca 
tcaactatac ttaacctaga aagacagcag 
30 taattttaat aagagagtct ataaattaca 
ttcccagagt ccatgatgaa ttcctgttaa 
agcaggggac ctccataagg gcttcctggg 

35 

gcacctgcac aatcagtcca caggtctata 
gtatttgttt ttgtaaataa agttttattg 
40 gtgtattgct gcttttgtgt tataataaca 
gtctgcaaaa cctaaaatgt acattatctg 
ctacacttct agatggtaaa atccttgaag 

45 

ctatattctg tgcctacaac aatgtttgaa 
caagaaaata aaaaagcaat gaatgcaaat 
50 tcatatcctg agctcttggt ctattggctc 
ggagagaaaa agagagaata tttaaaccag 
aatagaaata agtgtactgc ccaattgttc 

55 

gactagccct aaagccagtg ctctggggag 
attgttgtat tagcttctta ttgttactgt 
60 catataaaat catttagtgt ctcaagacaa 
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agaatttgac 


aaattttccc 


aatgtcactc 


37680 


tgtagaactt 


cttgtctcta 


atttccctct 


37740 


ttacttttag 


tttatgaggc 


tctcttcatt 


37800 


aaaatctgga 


attggccagg 


ttcatagggc 


37860 


actcaggacc 


tcagagttag 


aagagccctg 


37920 


ttgaaattat 


ttttaaaaat 


tttaaacaag 


37980 


attcatctca 


atcacctttc 


ccagaggtca 


38040 


tttatgcctg 


ttctaatagc 


agaggacttc 


38100 


agtctcttct 


tgcctgggca 


cctttacatt 


38160 


tgccccatca 


cagggttggc 


accccggtag 


38220 


aaggaacttt 


ggttctcctg 


gaagaaaaag 


38280 


aagaggccag aaaaagttga gctctagaaa 


38340 


ttaggcagaa 


ccatatgaaa 


ttgccaatga 


38400 


tttcgtatag 


ctcaagctaa 


gatatatatg 


38460 


tagctggtcc 


tgcattcatt 


tcagtcacct 


38520 


ttgtgtaaac 


atttttatgc 


ctgtttttat 


38580 


atttgaagca 


acgggtctat 


tcttgcctgg 


38640 


tacatgctat gtttttggaa gtccagccct 


38700 


aaatgtaacg 


aacccattta 


tttacatatt 


38760 


ggtttgaatg 


tgtgtgccag 


agaccttacg 


38820 


gcctttacgg 


aaagcttgcc 


atcttctagt 


38880 


acaggggaat 


tgtctgtttt 


gtttgctgtt 


38940 


acacagaggc 


cgctatagct 


gttgaaggca 


39000 


attctcaagg 


tctccttaag 


tctgcctcac 


39060 


ttccgcaaca 


gagggactat 


ccctaaccat 


39120 


ataatcgctg 


catcagtccg 


gggtaaaatt 


39180 


tccaaatttt 


gactatgttt 


cttggctgaa 


39240 


atggattaaa gactgaccta cttgatgttc 


39300 


aacaaattac 


tacaaatcat 


taggcagaac 


39360 


tacagtttgg 


ttatcttctg tttctggagg 


39420 



-168- 



WO 02/062946 

tcagaagttc aaagtgggtc tcactgggct 
tctgaaggct taggcaacaa taagttttac 
5 ctttcttggt ttgtcatccc atcctctacc 
ctctcttgaa ctctattctg cctcttgctt 
acccacctga ataatccagg aaaattctct 

10 

gctgcaaatt taatcctttt tgttgtgcaa 
cattcaaatc cccagaagct gaggattagg 
15 attactctgc ttacaatcac gaagatatgg 
tctagtactc aataggatat gatgacaatt 
gcactttcat cttccatgcc cacaatagca 

20 

ctgtgattat tttattagtg aaaaaagtga 
aagggtatgg ctgccattgg gaggcaggtt 
25 atgacattgt ggttgctggg aaatgttttc 
gttatgagtt ttctcctagg gtacggatgg 
ctgcattgct gtgtccctct ctgcctttgg 

30 

ggtcagtatt ggccttcatt tgaaggtatc 
ttaaacaaac ttatagtctt aacacagtac 
35 aagagtctgg gcccagtgga tgtgggtcct 
ggagttggcc ggggctgtgg ccttgtctaa 
cggaattaat ttcctttaag ttgaggactc 

40 

aggagagcca gagtctctag tgctttaatc 
ggtccatcca gaataatgag tttttttata 
45 cgggacttgt atcccatctt atacacaggt 
aggggtcagt gagtgtcacc ttagaactct 
tttccccttc ctttctcaaa gggcataacc 

50 

tgaagaggtc cccaaaagta cttggtttta 
ggaatacgga aggaatgtca tgatctcaga 
55 ctctcttacc ctgagacctg cagcagatgc 
cagagagagg aaagtgagga atggaaggag 
aatggggtgg gatatcctcc tagaattgag 

60 

ccccaacagc tactcattag cttcattacc 
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aaaatcagtg tgggcaaggc cgtactcctt 39480 

tgccttttct ggcttctgta ggctgcctac 39540 

ttcaaatcaa gcaatgcagc atcttcaaaa 3 9600 

ccacttataa ggattctgtc attactttga 39660 

cattccagct aataagcaac cttgatttca 39720 

aaagaacaat ggtcccccca gggaggtctg 39780 

ggactaggat gcagacctcc ctgggggacc 3984 0 

aatcatagta ttttctcaat gatatgagtt 3 9900 

atctttatac taccttccta acattttaaa 39960 

ctgggaggtg ctgaactagg gaataaatgt 40020 

gaagctactt cagcagatca agaattggtc 40080 

tcctgactcc cagccagtgc tttttctctt 40140 

acttgtcatc tccatttctc acattccctt 40200 

catggtgctc ctcttaagtg tgctggaatt 40260 

atgtaaagtg ctttgttgta gccccagtga 40320 

tgtattactt ttgtattgct gtataatcag 40380 

acattcatta tatttcagtt tccatggatc 40440 

tggttcagga tcttatgact ctgtcatcag 40500 

agcttctgat tctctttcat gtgaatgtgg 40560 

atgacaatgt actttttctt tgagaccagc 4 0620 

ctctcataaa gcattcacat gaagagatga 40680 

aactgaaagt caacaaatca gtcctacata 40740 

ttcatccaca cttgagagaa gggaatcaca 40800 

gcctagcaca agagctaaat aaaagctctt 40860 

tctgaatcca gataagaaaa tatttcacaa 40920 

ttctattatc ctattacaac aacttttcct 40980 

cttctctatc agtctgtgcc cacctgccca 41040 

tcctggcagt taagccaaaa gaaagttaca 41100 

agaagaaatc aagaaagagg ctctaagaat 41160 

ggttccagaa tgaggaaatg gatagtaaaa 41220 

caaagtctga aaggcagaca aaaccgtcag 41280 
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ccaaagaata aagtaattgt agtttgagga 
tttgtgcctc tctgtcttct ttagggattc 

5 

accctccctc atagaagaga gattgactat 
attgtaaaac cctagaggtg agagattctc 
10 aagtgaagta actttcatgt catgatattt 
gaagctgggg aatgttgttt ccctaatccc 
ggggcatatc ttggctaaat ctctatctct 

15 

aatgttgcac tgagtttgtg gaagaataac 
tgctattcca ggccagtgtg gtttttgcaa 
20 tgggtatgtt gatgtcatca gcatgaacaa 
catgatacct tggtttaccc atcatattgt 
cttgcgagac cagggcagtg gtcatcagca 

25 

tgtcctaaat ctagcctgat ctaaagaatg 
aatgaatcag agaagaaaag agtaaactct 
30 gggcactctg aggtgatttg ggtgactcac 
caccatcaaa ttctcacatg gcagaaatag 
gccaccaaaa gattaacaga agaatgctcc 

35 

ccacatgaga aagtaccaga atccaactcc 
atatggaatc caattatgac ctctgtgtgt 
40 atatatatat acatatatat atatgtgtgt 
tttttcccct ggaactcaac aactaatttc 
aaattaataa ataatgcatt taatgaggca 

45 

aggactttat ataaaggcat taaactggca 
tattgctaaa acgaggtctc catgcaaaca 
50 ctcacttact tgactttttt tctgccatat 
agtggagatt catgcccttc tcccccctcc 
agtaccttgg aatagtatag tcagtccttg 

55 

tatacatgga gtaaaaatca tatgaaggat 
tcttcctctc ccttccccca ccttctgctg 
60 cctgtgtcac atatggaaat gtgcaacata 



gattgtttct 


ctggtgccct 


tctccatcat 


41340 


ccaggcaata 


ccatatccca 


ctccccgaca 


41400 


agcgcattaa 


ctctagtggt 


ggggttttgc 


41460 


agcaattacc 


aatgattttg 


taatttcatc 


41520 


aagtctgtgt 


tccctttggc 


tctttgcata 


41580 


ctggtttgag 


tccagaaaag 


cagtggcttt 


41640 


tcctggtccc 


ccaccgtgcc 


ctaccaatcg 


41700 


cagaactttt 


cttagttttg 


tgatgatacc 


41760 


tgctttattt 


gtaaccagca 


aatcatgatt 


41820 


cctggtttgg 


aagaagagcc 


ccatattttt 


41880 


gcactgtcat 


ccacatgtaa 


gattgatcct 


41940 


gctcaggtca 


agagctatca 


ctgaatgttg 


42000 


actgagaatc 


cccacatgac 


tgatattgag 


42060 


gataatgatg 


actttatgtg 


ggcaggaaag 


42120 


ctatggcatt 


ctgattgttt 


gtgctaatta 


42180 


catctcccac 


accacttaag 


acagtttgat 


42240 


agaaatctat 


gctgactgta 


acacaagaac 


42300 


aatactgata 


gacatattga 


tatcattatt 


42360 


gtgtgtgtgt 


gtatatatat 


atatacatat 


42420 


gtgtgtatat 


attcaaaatt 


ttgttctcat 


42480 


attggccctt 


tatcgagagt 


actagaagtt 


42540 


gcagcacttg 


aaaggttttc 


attcatcatt 


42600 


aataagattt 


ggaagcagaa 


gggcaaaagg 


42660 


catacttctg 


ctcccctgta 


taacattcct 


42720 


ttggggacca 


aagtgctttt 


tccttcatga 


42780 


ttttccttct 


gctttccttc 


acccatagaa 


42840 


catgtgcaca 


agctatcatt 


tcagtaaagg 


42900 


cagattcaac 


ttatattttc 


tattttttct 


42960 


ggcagaatta 


tatcttaatc 


aaatgtgtat 


43020 


tggtatttgt 


taatgtttgt 


taattacatt 


43080 
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tgctttttta ttgcagagca aaaataaaat 
ttctattcag atgcaatatg atggaaagag 
5 gtggtaaata atgtggactt tctaatcaga 
ctttgttgtt tactacctat gtagctatga 
tttctcttca gcaaaatagt aataatattg 

10 

gtataataaa gatagagcaa ttagtactct 
aatacatata catatgtaca tatatacata 
15 tcagcatata tttgtaaagt atctacaatg 
atagcggtga ataagagaga taatatcctt 
gacacagatt aacataaaac aatcaaaaca 

20 

gaattaatgt gctggagagt ggttgttggt 
taaccaggaa agttcttttt gaggaagcga 
25 gaatttgcat ggacattcat attttccaac 
cactcacgta tctattgtat ccacctttct 
cctccttctt tcaatgctaa cctgtctact 

30 

gaaaaagtct attatgacat tctatcttct 
aaccaccaaa tagaacctta atactgttgc 
35 actattgcat agatgttcta tgtctcttat 
tttattactg gcatacgtgt gtgtctgtaa 
gcagaaatag caataaatgg aaatgaaatg 

40 

tgtcacaggg ttcatatagt agcatgatgg 
agcttcaagc ccaacaggaa agaggaaaga 
45 aaaatgatga gagtaatttg attggaaatg 
aggtcacaaa gctaccacag aacttcactc 
gagcagaact gagaagcgcc aagtgaagga 

50 

aagtcaaagt ttacctcaag ttgggccaat 
gaagatgatt aactgctata actgggcaaa 
55 ataaaagcca gaactcatac taaatcctag 
gtcttggatt ttgtaaaata tcctaactta 
aagtggcctg agaatgcctt ctgttggttt 

60 

taacacttac ataagcaacc taagattagt 
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tagaagcaat actttcatgt gctacctcct 43140 

actaaatgtt gatgtcgaca gtactgtatg 43200 

caaccctggg ttttcatctt gctatcccat 43260 

acatgccaca aaatttatct gtgcttcata 43320 

tctggcccat agacctatga taatttcttg 43380 

ggtacatatg gagcctcaaa agctttaatt 43440 

ttatttatat ttgtcagttc atcattttat 43500 

gggcagacat tgttttaagc atataagaaa 43560 

cctcttgtag gttaaacttt tatgggacag 43 620 

tttttatatg ctgttagaag ctaactgtaa 43680 

agtggttgtg aaatatgagt cattagagag 43740 

tattttcatt atgataagat aaaaagtatt 43800 

tcagtatata ggatttctca ttctgcccta 43860 

ccatgcacgc ttttcatggg aatgtgaggc 43920 

ttctccagct ccggagatgg acagtatctg 43980 

tggctgagga ttgaaaggta atctagcaag 44040 

ttactattat gtagtatata gaaccttaat 44100 

gggacatcag tgagaataca tgtagcctac 44160 

aggaagtatg tttaaaagaa agtcaaataa 44220 

aatgcttgat ttcatggatg agcatctgtc 44280 

ctgctgattg cagcaaactt acctcttcac 44340 

gagggagatc atttcttaga cagtgaagtg 44400 

tgtcagtcat tcacacaatt aatttttcct 44460 

tagttgagtt gatgtaattt gctcacagac 44520 

tcagacccta gcaaaatcct ttgcatctcc 44580 

ccctgacgtt ttaaatatat aaaggcaagg 44640 

atccttcaat ggcttcctct ctaactcata 44700 

aagtccttac atctcctgat tctttgttct 44760 

tatcctgata tattgccttg agattatttt 44820 

aaattagcgc atacataaac tttttaaggt 44880 

acatttttag ttgtaaaaag ctaatgcaaa 44940 
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tattttaggc aattcatgca aaatattatg 
tatttcatat catgcaattt taaatttatt 

5 

aagtgattag gatgtaaagc agtaggaaaa 
catttacaag aaaaatagtt attcttttta 
10 ggctctgtca cccaggctgg attgcagggg 
ccttggctta agccatcctc gcacctcagc 
ccaccacacc tggcttattt ttgtattttt 

15 

ggatggtgtt gaactcatga gctcaggcga 
gagtacaggt ttgagccact gcacctggcc 
20 attccccaca acttactagc tgtgggtagc 
tataaaaatt acaatagtac ctaccttata 
aacttgtaat cacagaatga agattgacac 

25 

atactgatag gtgacatata aagaattatt 
aataaaataa taaaataagc taaagattaa 
30 tattatactt ttaaatattc aggaaaagaa 
tttaaataac tgtactaagt ataagagtaa 
accctgaaaa tcggtaattt tcattaactt 

35 

tattactgtt tcaaatttaa taccagaata 
tccctcaaac ctattttcca gtcattgtga 
40 cagttttcca ctgtcaatag taatacgcaa 
agagaggagt atttaacttc tacagatgtc 
ctggagtcag gtggaaagga tgatcctcct 

45 

taaatcaatt ttgtgtttag gaaaaactca 
tttggaaact aggactgttt tcggtttggg 
50 tttttaggac agaagaaaca tagtagggaa 
agtgtaacac aggtgcatgc attcagataa 
caagctccca tacattctat atacaaaaac 

55 

gtcattaatc ttcattgtaa acacattcat 
atagcaacaa tttgccaaca gcacacacca 
60 tcctcaccta caccaattcc tttcctttag 



atttcaactt gtagagaagg ctgcctcatt 45000 

ctttatttgg ttattctagg ttctatctat 45060 

cgtccatgaa aactttgcct tcttttgcta 45120 

tttatttatc taatttttga gacagggtct 45180 

tatgatcttg gctcaatgca acctccacct 45240 

ctcccaagta gctgggctta caggcataca 45300 

tgtagagaca aggtttcgcc atgttgctca 45360 

tccacccact ttggcctccc aaagtgctgg 45420 

aagataatta ttctttacat acatgattaa 45480 

ttatcattgt gtttcttcag tttcctcatc 45540 

ggctcttaca aagactacag gtattattaa 45600 

tggtaggcga tatgaaaaga atggtgagtg 45660 

aaacaatata aaagcaaata acaaagtaaa 45720 

cataagtctt aatttgagca cataagcagt 45780 

acagataagt gtgtccacag aattctgatt 4584 0 

tgtccatatg ggacacaaag aagctgcact 45900 

cagtgaagtt ttagaagtgg gaatttataa 45960 

atctaagatt ctctaaaaat ggagtcatct 4602 0 

gttaggtaaa atgagattcc aactgtagta 46080 

ttgggataaa gtagtggatc tttttctcta 4614 0 

atattgagaa tgacatggca gaacaggcac 46200 

acaagtaatc agctgggcat gcatatgata 46260 

gtgtgggtca cttagtggct ttgatgtata 46320 

taatagctct cagttggcta tctgagagga 46380 

caagaaaaag actataatct aagccaatgt 46440 

ttttagaatg ctactgacca gattgaatgg 46500 

atcagaaagt tatttaaaaa tgtgtactct 46560 

agcaccaaag gataggtttg ttgtggaaat 46620 

taaatcttct tcaaatcttc aagagatagg 46680 

atcatctcaa cttgtattaa taagacgtag 46740 
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aggcaatttg attctctgtc tgcttagtga 
taggccaaga attgcctttg tattggctgg 
5 caattagtga cagtggttaa ccatcaattt 
cataaggtct tgagtatata aggtgatatt 
cctattcagg tagagtcagt atacttggga 

10 

ttatcgagtg ctcaatggtg cccaggctgg 
acctccacct cccagccgcc tgccttggcc 
15 cggccgccac cccgtctggg aagtgaggag 
cgtgaggagc ccctctgcct ggctgcccag 
tgccatccca tctaggaagt ggggagcgcc 

20 

gaggagtgtc tccgcccggc cgcccatctt 
gccctgtctg ggatgtgagg agcacctctg 
25 agcgtctctg cccagccgcc ccatctgaga 
ccgtctgaga agtgaggagc ccctccgccc 
gtctccgcca ggcagccacc ccgtccggga 

30 

gctgccccat ccgggaggga ggtggggggg 
ccgggaggtg aggggcgcct ctgcccagcc 
35 cccggccagc cgctccgtcc gggagggagg 
ccccgtccgg gagggaggtg gggggttcag 
agggaggtgg gggggtcagc cctctgcccg 

40 

gggttcagcc ccccacccgg ccagccgccc 
cccgcccggc cagccgcccc gtccgggagg 
45 tgggaagtga ggagcccctc tgcccggcca 
tcattgagaa cgggccatga ggacaatggc 
ggggaaaaga ttgagaaatc ggatggttgc 

50 

agacttttca ttttgttctg tactaagaaa 
gtgaccttac ccccaaccct gtgctctctg 
55 tggattaagg gcggtgcaag gtgtgctttg 
ttaagagtca tcaccactcc ctaatctcaa 
cgcagggtcc tctgcctagg aaaaccagag 

60 

tccctccact attgtcctat gaccctgcca 
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ttgcaagtgg ctgctctgct gggttgaagt 


46800 


tatgtacatg tctgagagca agggctctgg 


46860 


gctctgtgaa 


taataaggat 


aaaaacaagt 


46920 


actgataact 


tctaactata 


agtagtccaa 


46980 


ctaaaaatga 


tttatttttt 


aaaaaagtat 


47040 


agtgcagtgg 


cgtgatctcg 


gctccctaca 


47100 


tcccaaagtg 


ccgagagtgc 


agcctctgcc 


47160 


cgtctctgcc tggccgccta tcgtctggga 


47220 


tctggaaagt 


gaggagcgtc 


tctgcccggc 


47280 


tcttcccggc 


caccatccca 


tctgggaggt 


47340 


ctgagatgtg 


gggagcgcct 


ttgccccacc 


47400 


cccggccgcg 


accccgtctg 


ggaggtgagg 


47460 


agggaggaga 


ccctccgcct 


ggcaaccgcc 


47520 


ggcagccacc 


ccgtctggga 


agtgaggagc 


47580 


gggaggtggg ggtcagcccc 


cgccaggcca 


47640 


gtcagccccc 


cgcccggcca gccgccccgt 


47700 


gcccctactg ggaagtgagg 


agcccctcta 


47760 


tgggggggtc 


agaccccgcc 


cggccagccg 


47820 


ccccccaccc 


ggccagccac 


cacgtctggg 


47880 


gccagccgcc 


ccggccggga gggaggtggg 


47940 


cgtccgggag ggaggtggag gggtcagccc 


48000 


tgaggggtgc 


ctctgcccag 


ccgcccctac 


48060 


ccaccccatc 


tgggaagtgt 


acccaatagc 


48120 


ggttttgtgg 


aatagaaagg 


ggggaaaggt 


48180 


cttgtctgtg 


tggaaagaag 


tagacatggg 


48240 


aattcttctg 


ccttgggatc 


ctgttgatct 


48300 


aaacatgtgc 


tgtgtccact 


cagggttaaa 


48360 


ttaaacagat 


gcttgaaggc 


agcatgctcg 


4 8420 


gtacccaggg 


acacaaacac 


ctcggaaggc 


48480 


acctttgttc 


acttgtttat 


ctgctgacct 


48540 


aatccccctc 


tgtgagaaac 


acccaagaat 


48600 



-173- 



WO 02/062946 



PCTYUS01/48437 



ggtcaataaa aaattaaata aataaataaa 
ttttttcctt agacttactg aagcaaaatg 

5 

catatatttc tgtccccttt atcctctaaa 
caaatagtat aagaaacgta atgatactaa 
10 tcagtcaatg ataaagaaat gggtaatcaa 
gaattgctgc aaaaactgct gtctattgag 
tatatttata aaatcaaatt aaatccactt 

15 

aatcattgtt acaacagtta taagaggtat 
ctgaatacta tatgcaaagt caggagcaaa 
20 gtcattctta cattacacag taaattggat 
catctcgggt gttcacaatg accattaaaa 
gttttcaaaa tttgccattt tccacaactc 

25 

gtaaacttgc aatactctgt ggaaaggatt 
gaacatgaat taaaccagaa aaaaaggaaa 
30 aataattgat gcattttatc ttaatacttg 
tcttgttatt aatttttatc attaaaatgt 
ttatttttct tcattattaa aattgatttg 

35 

ttagaattag catgtcaaca tatccaatcc 
ggtattgaat ttataggtct ataagggaag 
40 aatccattaa cagaacatac tttttaattg 
ggttttttca ttttctgtta acatgaggaa 
ctggtactaa attttctcag atctttttgt 

45 

taagttttga ttaatatttt caaaggatgt 
gtactctaat aatgtcttat gactgtcttc 
50 caagtatcct tatctttgtg ataatgagtc 
tttcttactc atttttagca acctaattaa 
catccagctt gtggctagtt aggctgtttg 

55 

ggaaattcca atgatttata caaatatgtt 
ttacatgtta gactacttca tactgtccca 
60 ttcctactct ttgcttcaga ttatagagat 



taaataaata aaaagtattt atccagtatt 48660 

aagacttccc actaagctaa atttgtgtat 48720 

atctaatttc tcttcactca tcctatatat 48780 

aagattaaag aatattcaga cgtgattttt 4884 0 

taatgagcaa tttgtaagtg tactataata 48900 

aaaacatgca tccaactatt aaaggtaatt 48960 

gtaaatgtaa gagaagcctc aaagggcttc 49020 

ttctatattt gaggttgtaa aataataata 49080 

tcaacagttc taacagtcta acagtgatgt 4914 0 

catagagcat gaaacagatt caggaagaat 49200 

atctgtagtt gaacttagtt taaattaact 49260 

ttatttttct tgattctgag tcctgatcct 4932 0 

tcaagagaga tatactttta tctttaataa 49380 

tgagttgaag gccactgctg tggctactga 4 944 0 

actggaaatt tcctgtttct ttggtttgta 49500 

ataatagctt tatagtagca catatctatg 49560 

gccattctaa gtctttattt catgtaaatt 49620 

taaaagacct tcctgaattt tgattaggag 49680 

aattgctagc ttaacaatat taagtattct 49740 

atttaggcct tcattaatat gtctcagcaa 49800 

cg'tcctttga cattcattgt attgattttt 49860 

ctgttgtctg aatacatctt tattttacct 49920 

ggatcccatg ttgacagctt ttgccttcaa 49980 

tggcttacat aatgtcttat gagaagtctg 50040 

cttttccccc tctgctgtta aaacattctc 50100 

gttatgcttt ggtgtagttt gatttttctt 50160 

gatctgtggg tttatacttt tatcaaattt 50220 

ttaaattcat gctctcttct tggactcaga 50280 

ctgatgcttt attaatgttt ccagactttt 50340 

tctatttcca ccttccagtt cactaacatt 50400 
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ttcttctgga aaacttaatg tgttgttact 
tgtagctttt atctctagaa gttctatttg 
5 aagtatttct tgaaattctt atgcacattt 
attgtaatct ctataattta caggtctttt 
tgtgattttt ctgtttcttg ccatgtatag 

10 

tttttatttg attgagtacc caaatttttc 
tgcaggaaat taaacttaaa gaccaatttg 
15 cgggagggtg tattagtccc ttttcacatt 
ttataaagaa aagaggttta attgactcac 
ggggaggcct caggaaactt acaatcatgg 

20 

atgcagtgga agcaggagga ggagagagtg 
ccagatctca tgagaactca ctattatgtg 
25 ccgatcacct cccaccaggt ccctccccca 
tgagtgggga cacagagcca aacagtatga 
atgctttact aataatttgt ttattattaa 

30 

tgacttatta ttaataacct ttctggaggt 
gtctctgcat tccagctgat tggaacttga 
35 atcatacagc ttacaactac ctggctagtc 
aaaggtacag ctttgtattg gacaacaaat 
tctctggaga gctatcccct ttattgttct 

40 

ctcaaattct ggtttttgtc ttatgaattc 
catggtatgg aaagtttctt catgcagaaa 
45 tttcacttgt caatgttgaa attcctgcac 
gcttattttt ttccagtttt ctttatgatt 
ggttcaagtg cagtggtaca gtcataggtc 

50 

atcctcccac ttcagcctcc caaagtgctg 
ccagtttttt aactgtacag agtggaagat 
55 tgagcttcaa ctttaaaaag atgctcatga 
atatttaaaa tttacttaag cttgtcaaca 
tttaaaatat ccaagcagaa aatttaagta 

60 

caccttattt aagtttcatg aagtagctat 
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cataggcagt gatttattca tttcatatac 50460 

attttttata tttttatact ctttgtgtcc 50520 

ataatgaata ttctaaagca cttctctttt 50580 

ttattaattt ttctcctggc tatttgttat 50640 

taatttggga ttagatgctg gatattatga 50700 

ttttttcaca agatattcca ttttatttct 50760 

attcttttga gacttttaca aaggcatgat 50820 

gctataaaga actacctgac actgggtagt 50880 

agtttcacag gatctacaag aggcatggct 50940 

cagaaggtga aggggaagcc agcatgtctt 51000 

aagggggaag tgctacacac tttcaaacaa 51060 

aacagcaaaa ggagagtcca ctcccatgat 51120 

tcattaggga ttacaattca acatgagact 51180 

gaaggtttat cattgttatt aattgggggt 51240 

cagattaatt attaaagcat taataataat 51300 

ctctattcta tcccccaagt attcatggaa 51360 

atgtctccca gcattgtgtg agccctggta 51420 

cttgactgta cttttaaaat ttcaacctat 51480 

ttaggatact tgcagatttc tggaactctg 51540 

cttctcatta attccagctg tttcagtgcc 51600 

agcaatatca gaataaatct ctctccctac 51660 

accagagcaa ttattatatt cactttgttt 51720 

taactttgtc caatgtccgg aaagagttgt 51780 

taaaaaattt tattgagaca agatcatcca 51840 

actgcagcct tgaaatcctg ggctcaaagg 51900 

ggattagaag catgagccac tgggcccaac 51960 

taagtctggt tctatcataa ttggaagagc 52020 

atgtaaatat tagcattttt atttttcttt 52080 

tatgtaactt aattaaatca atacttctaa 52140 

ttatcatttt tatatgctgt tatttttagt 52200 

taaaagaccc tagcacattc cagttatttt 52260 
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atatttaagt taatttgtga tattttccaa 
aatatatgct caaatgtctt ttcttcacac 

5 

aagttgtaca cattccaggg aaaatattta 
ggtttaaatt ttgagtttac tgtatgaaaa 
10 tttgattgcc tggtggatag aaaaacaaaa 
aagctgacaa tgcttttgcc ttcctctttg 
tactgactca ggtttcttgg gattgctgct 

15 

tgtgaataac tctgtcaggc tgataatact 
tacactcttt ctagcagaag aagcagttca 
20 ttttcatcgt caatttcagc aaagcgtggt 
tgttttaatc cttctcagac tctaggtaca 
tcatctctta aatggatgct ttcagcttgg 

25 

agtcgccgtc tggatggggc tcaggaatca 
tgttgattct ttttccctta tgccagcagc 
30 agtaatttat tttttatctt aatttttttc 
atgaaaatat ttccacctta cctccagctc 
gtcaccaaat tgtctaaata acatacatgc 

35 

atatatacac acacacacac acacacacac 
aatatatata tgcattttta agttacttga 
40 tattacacac aaatatgtta tttgaaaata 
aatacattta aatatataaa taattgtata 
atgtttttaa aaacataata tatgcaaagg 

45 

actaatggag aaaggaaaag gagatgctgc 
atccaaagaa taatatctca ctagagattt 
50 agaagagttc ctagtgtatg gggtacgaga 
aattaagaga agcacagatg tccttaggga 
agttggaaag tgggtaacga aaactgagat 

55 

aggaaacaaa agcacaaata attgttttgg 
aaggagagat ccttggaact agaaaggcca 
60 aataagagat attttatgga taatgtgcac 
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tgttttatat agtaatatcc 


agacagtttt 


52320 


aacaaaaaat acaagaaata 


taataagaca 


52380 


aatcttttaa ttaattaaaa 


tatgtccctt 


52440 


acacaggaaa gatgtataaa gcaacatcct 


52500 


aacaactaaa tttgagatac agtttgaaca 


52560 


aatctcttca agatcagcaa 


tttatttttt 


52620 


gctcatgtga ttcccaggaa 


cttaactact 


52680 


agattaagaa aacaattcat 


cttcaataag 


52740 


gaaagttcct tgtggtttcc 


cttctgaatc 


52800 


attttccccc agtaaaaggt 


caagatagct 


52860 


gtttcaccag acacagtggg taacttgctc 


52920 


ccaggagggg ctggacatca agaggaggaa 


52980 


catagtgctg tattagctag 


ctgtcatcta 


53040 


aaagtgcttc aacccttgct 


gattacttcc 


53100 


tacttagtat tccctcatat 


ttggaaaaag 


53160 


tgaagccaca gacactgtgc 


ccctaacagg 


53220 


atatatatat atatatatat 


atatatatat 


53280 


acacacacac acacacacag gcacacatgg 


53340 


aaatatattt aaatacataa 


ataatttgtg 


53400 


catttaatta tataaataat 


tgtatattta 


53460 


tataaatatt tgtatattac 


atataaaggt 


53520 


gagagataag tgaaggagat 


agaggaatat 


53580 


ttactagctt ctagcttttg 


ttttccagca 


53640 


tatatgtgct tcagaaagat 


gtatagaaag 


53700 


ggtatcatgg aatggaaggg 


ataacatgat 


53760 


atagaagatt ttctggtcct 


gaaggagagg 


53820 


gcagatggga cctgggaaac 


agggctagct 


53880 


gtccaaagag ttaaggcctc 


cttattctct 


53940 


gagagacaga cttgctcatg gacctgcagg 


54000 


agggagttgt acgtatactt 


agaggaactc 


54060 
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caaaatttct ctccaacccc ttgttgtcaa 
agaaacttgg gataactaga tgacctctca 
5 cttgagtgtc atcagcaaga cggtggaata 
gagactgact tgacaacaat gtatgaacta 
ttaagaggtt gcagcacctt aggtgaatgc 

10 

agaaattgtt gtcgcatttt actcattcca 
tgatctacag aaaacattaa agttccagtg 
15 gtaagcaaaa actcagtttt atgtaaaaaa 
gggctggagt cctttaaaat taactgccta 
tctgtctttc ccaggcatgg tgaagcattg 

20 

agacagataa tctcaagtca taaaacatgt 
gcatgtctta attaaataac tgtctttgtt 
25 cagatctccc cccaccccac aaaatgctta 
attcctttgg atgttaatct gactggcttg 
caacctctct gtctctctga ttacttaatt 

30 

ggcaagagga ctagaccatg agtccagcat 
tggtttctgt cttgcctgac tcagagtgca 
35 aacaaaggca ggcacactat ttcaccacta 
gggggagaaa gaagaagaag caggcaaaac 
ccaaagaagt ttgaagggcc cccagaatct 

40 

ctacataaag tcagtttgta atgactggaa 
ctcaataaaa gatcacaagg catacagaga 
45 acaataaatc tccaaaaact aactttaaag 
taaagatatt tgtacacata tttattgcag 
aacccaaatg tacattaatg gatgaataaa 

50 

ttaatcaacc ttacagaaga aggaaatcct 
agatctcatg ctaagtaaaa gaagccatca 
55 acaaatactc tgtgattcct ctcatatgag 
ctgtagacag aacccctcag acaccaagtt 
cttcagcaag actcacgtct ccaaaaacca 

60 

ttaagggctt acaactttaa gtgggtccat 
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acataagcat 


ctgtttttga 


gctctgacac 


54120 


gcacaatagg 


ctctagcaat 


aggaaattga 


54180 


ggaagcctca 


ggccctcatt 


tccccatgga 


54240 


caatgccttt 


gtgagaactc 


tagaaactag 


54300 


aaagccagga 


aaagatgcat 


caaaatgagg 


54360 


gccactcttt 


caccctggca 


caatgcgact 


54420 


gaacaggaat 


taaaagaaat 


taaaaactgt 


54480 


aaacccccaa 


tttcccttaa 


agaagagaaa 


54540 


tttttctttc 


tgaggctagt 


gagccttatc 


54600 


tttttctagc 


tgtgcagctg 


caagatcact 


54660 


tgtttcttga 


aaaggaagaa 


atgatgtaat 


54720 


tctcacttct 


gtaatatgct 


tcctcctgca 


54780 


aaaggtaacc 


acattctttg 


ttcagggctc 


54840 


gtgcacctaa 


aaaattaaat 


aattcctccc 


54900 


atcccactgc 


accagattct 


cctttaaaga 


54960 


tctgggtttt 


gtgggggtta 


cccaaggtac 


55O20 


gagttgaata 


gcatgttgtg 


agcctcggag 


55080 


gggagattgt 


agtactgcag 


gcagacacca 


55140 


tcttttacta 


ggaaattaca 


catacatgtc 


55200 


ctagctatgc 


taacccgtaa 


aggtcttcca 


55260 


aaggttggtg 


ttgttttcaa 


atgcacaaat 


55320 


agcaagaaaa 


catggcccaa 


ctaaaggaat 


55380 


aaataagtat 


ctatgagtta 


catgatcttg 


55440 


cattattaat 


aatagccaag 


aggtggaagc 


55500 


gaaaatgtgg 


tatatacata 


caatgaaaca 


55560 


atcatatgct 


acaacaagaa 


tgagacttgg 


55620 


caaaaaaaaa 


aaagaagcca 


tcagaaaaag 


55680 


gcatccaaag 


tagacacaua 


gcaggacaag 


bb/4 0 


aaagaaggaa 


gggctttatt 


tggccgggag 


55800 


agttcccaga 


gtgagcaatt 


cctgtccctt 


55860 


gtgagagggt 


cgtgatcaat 


tgagcaagca 


55920 
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ggggttatgt gactgggggc tgcatgcacc 
tcacagtgct tttccataca atgtctggga 

5 

ggtcgatctt taaccaggcc cagggtgtgg 
ccgcctttta gtttttactt cttctttctt 
10 aggggtggtc tcctccttta gtcccccgct 
cactttcatt ttcactaccc atgtcttctt 
acatttgtgc tgaagcattt tggtgaacta 

15 

gaagtacagg tagcaaacaa gagagcagta 
ttataagagt tttaaatcct cctagcactg 
20 catatccatg ccacacttgc acgggcacat 
cttcaactac ttgcccttgg tcatctatgt 
tacagacccc tccttcagct gccagcaagt 

25 

catttctcat ctgagtttct tgctgggcca 
tgattatttc taagacagct tgtaaccata 
30 agtatcccca tgagctgtct tttgcccaag 
caggggacca tttattatct ttccaatttc 
catagacagg gaagcccagg agttcacctg 

35 

taatagtgcc aataacacaa ctactttccc 
gcccacatat ccagtataat ccagtggggg 
40 ttcacatggt ttacaacttt gggaatttac 
gccacggtgg ctgtttttgt agtactatta 
cccacaggaa gggtgaagtc cttccccact 

45 

gctttcaaga cccagaagtt atcagggtga 
gggtctgtag gtactaattc tcaggcttcc 
50 ccgcataaat aacatgaagt gacattgaga 
aacaaatttc ttgtttttcc tggaatttct 
tttgaaatac tggctcagga gagggtttat 

55 

aaggatccag tccagcccca tcgatttcta 
gatcaagggg gttggttatt accagttcta 
60 ggccactttt ccttttctga aggtggacag 



agtaatcaga atggaacagg acagggatat 55980 

tccatagata acataaccag ttagctcagg 56040 

tgctgggctg tctgcctgtg gatttcattt 56100 

tggaggcaga aatgggcata agacaatatg 56160 

ttgagaatct cactcattag tgggagttct 56220 

gcaagacaga tcaatagtga ttcatatagt 56280 

aggtagcgat gaagcttttt atcatttgaa 56340 

agcaggtttc tattactatt attactccta 56400 

ggaaccattt tccaaacatg gccccaggat 56460 

gtgccagttt tgtcatattt ctaactatgt 56520 

gtagatagca attagtaagg ttaaatgtcc 56580 

agttgggagc caatctattt tgatagatag 56640 

gaatagtcaa ggctctgctg gtcttattag 56700 

tgctttggtt gagcatgtaa atgggggtcc 56760 

tagcaggcca ataatattgt atgattcttt 56820 

ttatggctat gtttctcttt tcatgggaag 56880 

tctttatggg cagtaggaag aaagatgatt 56940 

actggtcagg taatttggca taagttctat 57000 

ctgtccagtc ctggtggtac tctgggtggg 57060 

taaatggatt tttcttagtg tggtttgaac 57120 

tacagttttt gcccaaggca gctgggtctt 57180 

cttgctatac agtattgtct aatgattgag 57240 

ttctcttgag ccaggaattt atcaggaact 573 00 

catggccatt gatctcccag tatagttcct 573 60 

gactgggcta catgcttggc taattgcaaa 5742 0 

ggcactggca cattcagttc atcatagaag 57480 

aaacttcttc tcaaactatg atatttactc 57540 

gggttacatg ctcccctttt ttccagggag 57600 

aggggttaca cggacccctg gtataggaag 57660 

gatttttttt ttcatttttt atccaagtag 57720 
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cctaaatgac acaagaccag tatccacatt 
tgtacttatt ttttgtcata tagcctcttt 
5 acttattact attaatgaca gcacaggcat 
ctttttcttc tgttttggct aacactttaa 
cagtccttaa tcttatttca aaaactgtgg 

10 

cataacacag gtcatttcct gggttacata 
ttattttcag agttccagta cacttataat 
15 tttgtcctac ctcagtgact tggtgtatac 
cagttgaagt ccttactgta caagttcaaa 
ttcttatgct ttggccatgt gtggaccagt 

20 

gttgtcctct tcagagtcac tttgcagggg 
tcacagtcta ctgatgttca aggatggtct 
25 ccaatacctc tacatattta tgttcagctg 
ggtttagggt gttgcaaact tcaatggtta 
cttggttaat ctagcatttg ttagccaatg 

30 

agcatggaag gcctttatat tcaggttttg 
taacagggct gtcactgcca gggcccctta 
35 ctagttgttt tgagagatag gccactgtcc 
ctccaactgc cattttttct ctttctgaca 
gtagccccag ggctgaggcc aacatgagtt 

40 

gttggttgta atagatgtag ttcatctaat 
atgttgactt taatcttgca gctatttgat 
45 atgggaatcc aattgcatct aaatagacgt 
tcgctttaca atgtcttatt tttcctccct 
tagccaattg gactgaagta caagtgccac 

50 

gtccaccaca ataccaccac acatctgctc 
gctcttgaaa attcttaagt tcactacatc 
55 cttcctattg tgagagacac gaaatgaact 
tcgggggctg aacttcaggg tgccagactt 
ttattactcc aggctgtaga atcctggaaa 

60 

aggaccacct tagtggaaag gggactgtct 
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cattttcaca cagtcctaat tcatgacaaa 57780 

tttaattaag agaaccatat cctatttcta 57840 

caaatttcaa ggtgactttt ttgggcaccc 57900 

tcatgtcatt tatgagcccc cacgagtcct 57960 

tcatgggaag ctcagatggg tcataacaca 5802 0 

ccttgtatag aataacatta tacaaacaag 58080 

aaccataaaa taataggacc atagcaactt 58140 

actgggaaca gtcctcagtc tgaggaaggt 58200 

ttttaaggaa aatgagtccc atgatgagtt 58260 

cagcttctgg atgtgactgg agcagggctt 58320 

ttggtgaaac tgctactgtc cacatacagc 58380 

tggaggttgg gcctgctaga ataaactagt 5844 0 

agctctctga taccgggaac aaggtggcag 58500 

tgggggattt tcacatagca agctttgata 5 8560 

atgtcctttg gtattcatca aagttaccac 58620 

cctaagcgtt agtttatctg cttcttgtgc 58680 

gacatggggg ccagcctttg gaaactttgt 58740 

ttggccaggg cccgacagtc tgggctaaaa 58800 

catagagtgt aaagggtttt gtcaggtcag 58860 

tttcttttaa ctcatgaaaa gttcattgct 58920 

ctacatattt atttacagtt atctactaaa 58980 

ttcaagcttt aaagtgatct ggtattcccc 59040 

gagagtcaaa agacccataa ggggcttctc 59100 

ctggctgatg aaatgccagg gtgaaaggga 59160 

tccagttatt cagcagagtg cccagtaaag 59220 

tgggatgaat aagggctgac tgattgataa 59280 

ccttcaggtt tccaaggaac gctaagtttc 59340 

tagtgttggg agatggcagc tggatggccc 59400 

tgggatatag cagagagagc ttggcatgac 59460 

agagatacca tgcagcccac ggtccactgg 59520 

gggcctctgg cctgccacgt gcacaagcat 59580 
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aacaattgct tttgtttaac gtgcaaatgg 
catcttggta tcttgtctta actgccaaag 

5 

tctagtaaaa tgaatgtatg attttaggaa 
ttgctcttta gtggtcaaca gaacgttgga 
10 gggcaagact cctggttgac actggagtct 
taatttacta atgcccagtc tgaggagagt 
gagagctgtc tttgacttgg caagtcccca 

15 

atagtttgag gcaaaattga cttggttatg 
ggaaagaaga aagagtaata gaatagataa 
20 gtagggtttt cccctgggac tatggcccac 
ggtgtgatga gtccatcctt tttccactgt 
agggtagggt ccttcctagg ctggctcgaa 

25 

aatgtgatct tcagactggt gctggcttac 
aagactttta gttttgaggg aaaggaaagt 
30 aaattgacct tttgttttaa atgtggggat 
ctttctactg agaaatttcc tttagcacct 
caaacagctt tttatatttg acgcttcctg 

35 

cacccttata ttagtgtgtt attaatgtta 
atttatttaa tttttaatgt tagatcataa 
40 tttataatct ttgttaaaga gcaggttaat 
ttaatgttca gttcacagaa aaactggatg 
cacacagaat ttcctttaca attgacgttt 

45 

ttaaaaatct tttaatgtag gtaaaaatcc 
ccagaggtgt attttacttt tcttatacac 
50 tacattcagg aggcctagtt acttttaaat 
ttataactct ttttctttca tgacttttgt 
gacttattac aaacatttct ttctttaaac 

55 

tactatataa gattcttttt acataaattc 
gggataacca ttcttttcca aagcaaactt 
60 gctacaagat tagaagttac tattatacat 



aatatttgat ccattttaac caggcatttg 59640 

tttgttttaa gtctttaact tctatgatcc 59700 

attacaaaaa ccagttgggt cagtccatcc 59760 

ccaactatgg cataaaagtt ctacattggg 59820 

ttaccaaagt ttcccccaat caaatagtcc 59880 

caggagggac agaggtactt ttctgaagta 59940 

cagggtataa taaggcaagc attaaatgca 60000 

ttaataacta gatggtcagc aatagagtga 60060 

aagagttaaa tttttctcag ctttagtttg 60120 

gactctggag cgggtggagc tttcttgact 60180 

atgaacagca gtcttggtgg ttagcagcac 60240 

tttcctcttt ttttttcact ttttgatgag 60300 

cagaaattgt aggggtggca catgtgttaa 60360 

gggagataaa ccaagtatat aatttttaag 6042 0 

gtcagcagtg gactttatag tccttggtgc 60480 

attttttatt agtttttaga ccaaagaagc 60540 

tatgattttt ataccagata agctaaatgt 60600 

aacttagttt taataaaact tagtaggcat 60660 

ggtaagattt ttatagactc tttttaacct 60720 

gctttaagag aaacccattg tgtttttatt 60780 

ataccccttt aactttagcc aatatgttta 60840 

taaaacttgc ttaaaccttc aaaacaattt 60900 

acattcttat gcctccttat aatgctttta 60960 

cttgcacata aactgttttc tcagtagttt 61020 

tatacaacat ttcttgcatc aattcttttt 61080 

agacaattct tcaacatgcc tcaactttct 61140 

aaccagttaa tttatttcag gacaagaatt 61200 

tgcccccgct tttttccccc tttttttggg 61260 

cctttatgtc tgtggattag actgtctaag 61320 

gttatactgt taacttttag caaattttac 61380 
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ttttgttgaa aaccttgtac gtttgggatt 
gtttagtcca aattaactta gaattggtat 
5 gggaggaacc gtctatcatc cagtcctgaa 
tgtatggcaa ttaagattta gatcccctgt 
agtggttaat gttaaatcat cttttttttt 
tgtgctcaca atgaggtttc ctctaaaagt 
agttgccgct acagagtgaa tgcatttgtg 
15 tgataggaag tctacaggtt gtcagtggcc 
acttacagca aggtggtatt ggagtgttgt 
ttataggttt taaatttacc ctgtctttta 

20 

ctacttgtat atctctttct ttctgtcttt 
ttttgcctct gtatctttct ctctctctct 
25 ttgactccct ctttgtctct ctgtctcttc 
ctctgccagc tgcttatgct gctgttctct 
cagctgtacc caagtgtcta tatacgggaa 

30 

cttgtgccac acctttgaaa caagggacct 
ctccaatact ggccagtcta tttcacacaa 
35 actgtaatct cccttaaatc ctttcttgaa 
tttgtgcctg gcccatgctt cctgaagaca 
tgggtaaaaa gggcacacac acacttttgc 

40 

atcagagtgt caggaaatcc aagccgggtc 
agtcaagtca aaatcaaaaa ccaaagtgct 
45 tgcttccact taaatggagt gggcaagttc 
ccggactcca agtgccagtt ccttcatggt 
gcctgctgtg cactgctctg atgaggcatt 

50 

tctcaggatc tgcttcactc aagctggctg 
ggcctaagcc acctaaggag ctgtctcgac 
55 ggaaccaaga aatgtagcag gacaagccac 
aaggaagggc tttattcggc caggagcttt 
ccccaagtga gcaattcctg tcccttttaa 

60 

gagggtcttg atcaattgag caagcagggg 
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tcaattatcc tttgctatta gtaagacttt 61440 

agacggcttt ttcttttcct tccattacct 61500 

gggagttcct cctaggtctg gtcagacctt 61560 

taggaaacct gctgggttaa gggaattttt 61620 

ttcccttagg atacttctga actggtgagg 61680 

tatttttcta ctttcttctg ttagcaaagt 61740 

ccatccgcag attactaggt taaagatttt 61800 

tcagtgcttt cgggctacgc acttgtttac 61860 

agggtcacag agaagacctt caattatcaa 61920 

aaggaatagg gtacactgtt tttttcttaa 61980 

gactttctgt gtctttctct ttgactttcc 62040 

ctctgcctct ttctttctct ctctctctcc 62100 

ctctctctct ctctgctggt ctttccttgc 62160 

caaccactgt gagtgggtgg ggtctaaaac 62220 

ctggtctggg tgccctggct tacaagttac 62280 

gtccaggctt ccttctgatg gccaacccac 62340 

agttctagtt tttcctggtg tcatagtaac 62400 

atttttcaac atagctccta gtggtcttac 62460 

aaacaccatg ctcacactgc aaaacaaaaa 6252 0 

agtttacacc aaaccagaat caaaaccaaa 62580 

aaaaccaaaa ccaaagtatc aagcaatcca 62640 

ggtacaggca caccatgggt gatcaggcca 62700 

caaagaccag tcttaccaag ttttagatgt 62760 

gttcagccac tgcattgatc ctccacaggg 62820 

ccactggggc aattgcctac ctgggagtgc 62880 

gagtccccct caggtatgct ccacagggca 62940 

catccattaa tcaccttgtt tcctggtcag 63000 

agacaaaacc cctcagacac cgagttaaag 63060 

ggcaagactc acatctctaa aaaccaagct 63120 

gggcctacaa ctctaagggg gtccccatga 63180 

gtatgtgact gggggctgca tgcaccagta 63240 
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atcagaacag aacagaacag gaaagggatt 
atctatagat aacataaccg gttaggtcag 

5 

gcacagggct gtctgcctgt agatttcatt 
ttgggggcag aaactgtgca taagacaata 
10 gcaaacatgg aacagaaagt aggatggtgg 
ggtggttgtt caatgggtat agagtttcag 
gttacacaac aatgtgaaca aaattaatac 

15 

ataatgaatt atatgtgttt tgtaccacaa 
ttgatttaaa gaaaatattg ttctgtttct 
20 aaaccctttt gctcctaggt cagtagctct 
ctccatcagg agacatttga caatatcaga 
gattgtgctg gcataggcca gggagtctgc 

25 

ctactgtaaa gaattaccca gtccaaagtg 
cagatcatta atcagaacgt gttattgata 
30 tagaattcag actctggaaa aaactgggaa 
gacaagagaa aggagaattg ttttaagctt 
gggaggattg gggggtgagg aagacagtca 

35 

ggttctgtgc taggaacctt acacgtgttg 
ttttaaatta aatagcacag agaggaaaac 
40 gcaactttaa ctggtttatt agcacattcc 
gaagagacca aagagactgg gctccttttt 
actgtttcac agatgctgtg agggtgacaa 

45 

ggcactctgc gaaagaggaa ggggcgccag 
ctgcccctct ccccaggctt tctgtgtgag 
50 acttaagaca agccaattac atttcataat 
ccagccttct aaatggtgct tcttaactcc 
ttaagtgcag aatccatggc cctctgtggt 

55 

ttcattctgg aagctccccc aaggaaagga 
attgccctga ctggctctgg tttccttgcc 
60 ccacccaagg ttttgaaggg gcacagtgcc 
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ttcacagtgc 


ttttccatac 


aatgtctgga 


63300 


gggtcaatct 


ttaacgaggc 


ccaaggtgca 


63360 


tctgcctttt 


agtttttact 


tcttctttct 


63420 


tgaggggtgg 


tctcctccct 


tagagacaga 


63480 


ttactagggg 


ctaggaatag 


gaaggaaagg 


63540 


ttctgcaaga 


tgaaaaagtt 


ctagaggtct 


63600 


tagtggtctg 


tacactaaaa 


agtggttaaa 


63660 


tttaaaattt 


taaaaataaa 


agttgagaaa 


63720 


agcacacctg 


cctttacaca 


gaacacacaa 


63780 


caacatgaat 


gtgaggggat 


tttgttctcc 


63840 


tactgtttct 


ggttgttaca 


catagggtgg 


63900 


taagcatcct 


acagtgcaca 


ggatagccac 


63960 


gcattagtag 


cttttttgag 


aaacctgtcc 


64020 


taccagtctt 


ctcccttgta 


gttaggttgg 


64080 


tgtaaaaggc 


tgaatgagtg 


taagctgaaa 


64140 


actaaagcca 


aaatggaaaa 


aaaaagaggg 


64200 


acagaccgct 


tttaggccat 


cattccgaag 


64260 


tctcacataa 


tccttaaaac 


tcagaaatga 


64320 


tgactggaga 


gggagccttt 


gcttaggtag 


64380 


cctgaaggga 


aagggggtgt 


gattgctcct 


64440 


aatcaaagct 


caggaggaga 


gctgcattcc 


64500 


agatgcaggg 


cacccactgg 


aaacacagac 


64560 


gagcttgggt 


gagcaaggtt 


99 a 99tgatt 


64620 


tccattcctc 


ctctcagatt 


tattgttaga 


64680 


gtctgtgtca 


ttcagacgct 


gagaactaat 


64740 


tttcttcaac 


aggagagtta 


atgtgtggaa 


64800 


gacggtgatg 


gttcaggttg 


tgtttctggg 


64860 


ggaaggaagc 


ttgcagggtg 


gggctttgcc 


64920 


tgatacattg 


aagtcagctt 


ccaaagaatg 


64980 


tctgtcctca 


caaaaccagc 


ctgcatatga 


65040 
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gtttactgaa ttcatttagg tcctgctgaa 

tatttcaatg cttccctgga ggggcccaac 

5 gaaaaagagc agccagagga ggaagatgac 

attgaaaggt tgctttcagc ctgccaagac 

aacaaacaaa caaactatat atataaaaaa 

10 

attactatta gctatatatt aatgaatgtt 

tattattatt atggttagtt atagacagtg 

15 ctagcactat tactgttatt agttacatgt 

ttataaatat ttttgctcat ctttatgaca 

taacagatta gaaaactgaa gcttcaagaa 

20 

gtctcgaagt ggcaattcac acataaggct 

ttggctcttt tactgacctt gctagaagtt 

25 aaatctacaa tctgtgagta tcacatcctg 

ctgatgactt tcagaagatg aaggtaagta 

ttaaaacctc tttggagctt ctgaggactc 

30 

gctggaagac ttttctcatc tcactattcc 

aacttgaaac tccttgacag cagttaagag 

35 aaatggaagg agtatctgtt tttaactctg 

gtgagtcact tacctaaggt tgtgtaatta 

tgtttagacc atttgtaaac tgggcggtct 

40 

tccaagctga gcttagcaac aaataccaca 

gaggaccaac gacgcgagcc tatgtgttat 

45 tgcagctgtt tctccagcct gagtaaactc 

ttaggaagag gtagaatgat ctagagattt 

agatccaaga ctggttttac agcttcccgg 

50 

actgcaaata tttcttcagc attaactctg 

ttgcattcat aacttaaact ttattttgca 

55 ttactaacat gcagagctgt tgtaaggatt 

tgtctggtaa aatatgataa acatatctga 

ataagtaagc agagttcatt tattagtctg 

60 

gttgtcagta ggactagagg aaagggaggt 
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tctttcccat 


ttgttccttc 


tctgtttcaa 


65100 


ctttctggag 


ggcaaagatc 


tgttgttaga 


65160 


ttgagggtag 


gcagcagttg 


ggcaatcagt 


65220 


ttggggaatt 


agaaaagcaa 


gataaaacaa 


65280 


caatttttta 


aaagtttttt 


tatgttctat 


65340 


cctttaaagc 


agtgtagtca 


ttattgctat 


65400 


gaatattgtt 


ctttctctat 


attatgatta 


65460 


tattgaaagc 


ttcaaagcag 


cataggcttt 


65520 


attctccagt 


gttggtattg 


ctcctctatt 


65580 


cagacttgcc 


taacaacagg 


aaacttgtat 


65640 


ccatgactcc 


tgaactctca 


caaatattag 


65700 


tacagncacg 


gaagtgcagg 


aacatttcac 


65760 


tatagctgta 


aacactggaa 


taaggaaggg 


65820 


gaaaccgttg 


atgggactga 


gaaaccagag 


65880 


agctggaacc 


aacgggcaca 


ggtaggtaat 


65940 


cttgccgtga 


cctcattaga 


ggagtaattt 


66000 


acactagggc 


cttttggaaa 


tagagtgggc 


66060 


tgtccgctgt 


attctaattg 


cataaactca 


66120 


atgggagcag 


agctcttgtc 


tcttcaaatg 


66180 


gtgctttgag 


aatggtgccg 


tgaaaaggga 


66240 


acctacttgg 


tctttatgct 


aaaaagtggg 


66300 


gagtccaaga 


cttgttctca 


taccactgca 


66360 


agacacaggc 


tggcaatgcc 


aggtgggaaa 


66420 


atgcaaaggc 


ttcagactcc 


aggaaagctg 


66480 


ctgtgtgaac 


cagggtgaac 


aataagggta 


66540 


ggccacacat 


atatgcaata 


ggtatattat 


66600 


ttttcttatt 


tattacatgt 


ttttactaac 


66660 


aaaanrart'a 
□ aaciy a v L. y 




attaaaaata 


6672 0 


gtgctaataa 


atagtattat 


taaacaagaa 


66780 


aaaattcatc 


aaatatatgg 


caatacagtt 


66840 


tgttgcarat 


tgcaaatttt 


gtgtgaccat 


66900 
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ttgcctctgg acattgagac ctctcaagat 
ttatctataa tggactgtgt caaacattat 

5 

ggttttcact gttccatact atcagtttct 
attggcaaag actgagttac tgtttttgga 
10 gagctaaatc tattttttct tacatagttg 
ccaatgagac catcatagtg ctcccatcaa 
ccgaacccac caaccagggg caggatagcc 

15 

ttattggggt aaatctaatt cagaacgtgt 
tgatatatgt cttgggactg gacatctgtc 
20 agggacttta gggtagaagc cacttggaga 
tgtcttaaaa atatattttc ccttattctg 
tttgtatttt tacaatacag tgcttttcta 

25 

ataatgacat attgtgtgta tgtgtatata 
tgactttttg caaataattt ttccctcact 
30 cctcttttct cctaacgttt ttgagaatgg 
ctcctctcac ccagaattta ttaaagaagc 
ggatctatct tggattggga gggaaacaca 

35 

ttatgtgatc tgaggtgagg cactctgttt 
ttccttgcct catgtacaat cacatgggtt 
40 aaaacttttt ggtaaatgat aatagagctg 
aactacctcc taagacagag cttcagttat 
ccctgtgtta gtctgctaag ggggatggcc 

45 

aacagaacaa tgcacactga ggcctttcag 
aggaaaaata actaatggat actaggctta 
50 aaacccccat gacacaagtt tatctatgta 
aataaaagct aaaaataaaa gagcttccag 
ttccgaggaa gtgaccaaag gggcaaactc 

55 

agggccttct gcactgccaa tactgcagag 
tcagagagat agaaagagag ttaactagtc 
60 gtaaatctca gcacttgctg tgagctggaa 



tcatgttgct gccaccctgt gatgtgttta 66960 

ggaaaaatcc tcatgaaacc aagattgtgt 67020 

ttctctaatg gacaggccaa gcctagacta 67080 

gtcagaacct gggagctctg tactttttaa 67140 

gcaacaccat catgacatca caacctgttc 67200 

atgtcatcaa cttctcccaa gcagagaaac 67260 

tgaagaaaca tctacacgca gaaattaaag 67320 

tggagagggg ttgggggaag tgccaagaga 673 8 0 

gtgagtgtgg agaccctaaa attttgctag 67440 

aaactgtccc agaacttttc cacaagaggt 67500 

aatatgagga attgattttc tattgttgtc 67560 

tattttcttt tacaaaagta atggtggtaa 67620 

ttccacaact gcatgcttaa taatccaaac 67680 

gcaaattaga ggaaatatac aactcctttt 67740 

aaatgatatt caccttttgt ttgtctgttt 67800 

tgtattgcat gaagggagga gagaaagaag 67860 

gagttttggt ttctagaatt gcctctgcac 6792 0 

gactggattt caggacctgg tgggactggt 67980 

tctatgaggc tcatgtgaca taaggtctag 68040 

tgaatctctg aggtggatgt caggaaggaa 68100 

acatgagaac atgcagtatt tggtttctgt 68160 

tctaaatgat gagaacacat ggacacgtag 68220 

agggtggagg gtgggaggag agagaggatc 68280 

atacctgggt gatgaaataa tctgtacagc 68340 

acagacatgc agttgtaccc ttgaacttaa 68400 

agcttcagct atagggttgt gaactgcagt 68460 

catgtccagt ggagtctggg agatgagggg 68520 

aatgtggggg ctttcagatg tcccctgttt 68580 

atggacttag aggtcctgag agctgggaaa 68640 

gcaagtctca tgagcagacc agggcctgag 68700 
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aggcagaaat gcagaagcct tcctgagcac 
ggcagagctg cacaccctcc aggatgtctt 
5 ggtttgtatc ctcattagga tcttgcagaa 
caaaaaggcc ctgtctttac aaatgctggt 
cagactatcc agatcttgtg tggcatgatg 

10 

gcttccttct ctccaaattt tacccaagtg 
ttcataggac ccttttttgt gagtagagtt 
15 gaggaagatg ccaatagctt agacttccca 
gagcaaggag tcaacggtga atcttgttct 
attttagatc aatttcttat cagactcaaa 

20 

ttcacattat catcttgagc tatgatgaag 
aaaaaatcca tgaattagga taaagttggg 
25 gcataaatgg aagacatcac acactcacag 
gtggggatca tggaggaagg agaacttaca 
cattgactga ttcccaggta aggctgagca 

30 

tcctctgaaa tagctgcaag tctggagttt 
aactgctttc tctaagatgg gggctttctc 
35 ttccttcaac ttcctgcaat tcagtagcta 
caccaagaag tccacctttt tgtgtatctt 
tacacttgga tttggatttc attggttttc 

40 

atgccaagct actacattac actacattac 
taggctgtct aagattttcc ttttccttga 
45 gagattttta tagaatgttc ctaaatttgg 
agtatggtga tgaatttttg ggtggtgatg 
tatcaagggt catgctacca acataatgtc 

50 

ggggtagtat tttccagatt tctcctctgt 
ttttatgtcc ttccctagag aacatcttac 
55 aaaaattatt tcctggcttt taaggtcttt 
aggatatatt tctgctatct tttttttttc 
ggacaaatct atgtatttaa atggaactta 

60 

tcccatgaga gttgtctata aacattgtga 
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ctgccagaca cctgagactg tggcttctgt 68760 

ttcatatttc tcactccaga gtttccacag 68820 

ggcgataatg cattagtgcc atgggagcgg 68880 

tttcatcctc aatctcactt tctcttccca 68940 

gtattgagct tggggatcat tttggcatct 69000 

acttctacac tgttgaactc tgcttaccca 69060 

tctgaggagg gcaggatggg gcaaagaggg 69120 

cctgccagct tgctatgttt gatctgccag 69180 

cctgtcgggt aggatgacag gggttgcttg 6924 0 

taaacatttc ttttgaagat catcttattc 693 00 

ctagtgactt ctctccaggt ttaggcgaaa 693 60 

aaggaacatt ttatacaaaa aaaaaaaaga 694 2 0 

agaaaccaca ctgttatccc ccccatgctt 69480 

cctctgagca ttttgaaaat acttttactg 69540 

gatatctatg cccagtgatt gacaggaaat 69600 

caaaaggact tagtagtcct ttcctcagag 69660 

ataggtaagt ttccctttca tagatgtttt 69720 

tttgtttttc ttacatttgt gtatttgatg 69780 

gtccattttt cttgtcttat taactaaagt 6984 0 

cccaaatgac ctttttctgt tccaagatcc 69900 

attttgtcat gatgtctcct cagggtcctc 69960 

gaactttgat agttttgggt agcgctggct 70020 

gtttgtctga tttttttttt tgatgtttag 70080 

atgaggtaaa gcaccatttt catcaagtcg 70140 

actgacacaa ttaaccttga ttatgtggtt 70200 

aaaattacta cttttttttt taaccattgc 70260 

accaaaactg tctcatttta ttaaaaaaac 70320 

ttttgtttag tgggatattt tttgtttatt 70380 

tgatgttatc tgaatgttat atggacaaag 70440 

cctcattgat cactggtttc tcagggattt 70500 

tagtgacgtc gaaggtactc tgtacatgga 70560 
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atctgacctg 


accattttct 


cttttatctt 


agtttatcat 


ctctggctct 


ctatcaatcg 


70620 


5 


ccacagagaa 


aaggttaacc 


aagcttttgg 


tgagtaagag 


agctataacc 


cacaacccaa 


70680 


tggatcaaaa 


taaaacttag 


acagaactct 


ttaaggctag 


tctttctact 


ttgaaaagtt 


70740 




gaaaactcag 


ttccagagca 


gttggttgtt 


gttagaatta 


ggagctgggc 


ctggatcctc 


70800 


10 


tgtatccttg 


ttcagagaac 


atttctccaa 


ttattgtacc 


ctacatcaaa 


gtctgcattc 


70860 




aggggattat 


aatattccct 


ctgcccatgc 


cgaagaatgt 


atcacagaga 


aattgtgcct 


70920 


15 


gtttatgagg 


ttctttcggt 


gataactggc 


cttcaaattc 


aggttttcag 


tggcaaggaa 


70980 


gctgacagtg 


ttataaagcg 


gtctattggt 


tggggtccat 


tctttaagcc 


caggtgttac 


71040 




aacccttgaa 


aaaaaaatga 


gtcaaagtgt 


tgttcatgtg 


aggtatccaa 


agtagacaca 


71100 


20 


gaggctacta 


cagtatacta 


cattacattt 


aggcctgatg 


tctcctcagg 


ttcctttaga 


71160 




ctttctcaga 


ttttcctttt 


ccttgaggac 


ttcaatagtt 


ttgggtagtg 


ctggctgact 


71220 


25 


gtatcctttc 


atctatctca 


ccagaagtat 


aatactttta 


ttttgtttga 


gtataaattc 


71280 


ttgcacccta 


aaaagttgtc 


cttagtcatt 


tgtattggct 


aacaaaaaaa 


caaaacaaca 


71340 




acaacaaaaa 


acaaagcttt 


acccgtcttt 


atcccttatt 


ccagcaaaaa 


ctagagttgg 


71400 


30 


aagtggcagg 


gagacaaagc 


ctggattata 


gggagaatct 


tctcttgtct 


ttaaagtttc 


71460 




attaagtctt 


cgctcaatcc 


attatctctc 


aaggggttga 


tgttggagat 


tatgtgagag 


71520 


35 


aatgtcatac 


catgcaattg 


caccgagact 


caatttggaa 


gctctggcta 


caaaaatctc 


71580 


ccaaagccag 


caaggaagtg 


agaaacggca 


tccagaaaaa 


tgccaattct 


ttttctcatc 


71640 




ggtatttaga 


gattattatt 


tggaaccctc 


agttagactc 


ggaactacta 


actagatctg 


71700 


40 


tgctgtttaa 


tttaactttc 


tttgaggatg 


gaaatgaaat 


gtaactttct 


atgatgatag 


71760 




aaatgaaatg 


taactttcta 


tgatgatgga 


aatgcactac 


agctgtactg 


tacaatatgg 


71820 


45 


cagtcacatg 


tggctatcga 


tgatgtgaaa 


tatagtgcaa 


ctgaataaaa 


atttctctta 


71880 


attttacttt 


aaatatagat 


agtctatgtg 


gctattggca 


accatactga 


gtagcctggc 


71940 




acaagggtag 


tgctttcaaa 


cttttaggag 


gaaatatatg 


ggcctgttac 


tccagagaac 


72000 


50 


aatcccacag 


tgaacatgca 


ggaagctcaa 


ggatgcagga 


tgtggtctcg 


gggtgggctc 


72060 




agacttacca 


gagacatcga 


ggctttagcg 


agttctactc 


ttgtcacaca 


ttgcaccagc 


72120 


55 


cctttttgag 


attgaagaag 


agtgccatct 


caagacaaat 


ctagcttatt 


ctaagaaata 


72180 


tagactctag 


gcatcaagaa 


tactactggc 


tactaggagg 


cccaggagtg 


tacacttcag 


72240 




tcattatgtt 


gggaaattgg 


ggtcccagag 


tatcaactgg 


tgagttccta 


aaagaagaag 


72300 


60 


tgcccatgtg 


ggattaagta 


acattctact 


gtatggctgt 


gaggtgtgcc 


ctggagccta 


72360 



-186- 



WO 02/062946 

cctacctgta actatagctc tcagagtact 
ggagtacaaa agaagtcact tctgaggtta 
5 ataaatgaga accagagtgg gaaataaaat 
aaggcaagtc tagtactgtg acttcaatat 
acttcccgct gcctcccaca cattacacca 

10 

cagccacccc atactgcact cagtgtggcc 
gcctgagact cagaaccacc ccctcactca 
15 atgggcaggc gagagtgagc tattaggttg 
aaccacaatt acttttgcac catgctaata 
ttccgccagg ggtttttgag ttgagcgaga 

20 

cactttgggt cacgctctct tggtaatgtt 
ctgtcctggg gaaagaattt aggagagtaa 
25 taaggtgaac cagttcgtat agtcattcat 
atttttgctg gacactggag aggcatacag 
gggacagata agggaataac ttacaattta 

30 

cattagcaat gatccttacc caacatttct 
cattctgagt gctctgtctg ccctggtggg 
35 cttaaatcct gcctcactgc agtgtgagtt 
tgtttcttac ttttatcatg attcacttta 
tctggctgta agtattttta gatgggatgt 

40 

atttcttgtt attccttctt ttgaaaatct 
gaaagccagg tttatgtaaa tcaaagggga 
45 ctaatagagt ggaattgaaa atgttaaata 
tttgtagagg ttcttaatta tcaaactaat 
tcaaaatgag aatcttggta catggagtgg 

50 

cccaggagat cagaataggc caacaaggaa 
tggtttgaga agacatcaga ttcagtacta 
55 aagataaggg cttcttcttc taatgatcac 
gcaacattat tgttgctttt tgttatagct 
taagaaatat agggagatgg agagtgtaat 

60 

ttggaagaag aaaaataatg aagtttggct 
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cttagctcaa ttctcttttc tgtgtgagtt 72420 
gtttatctat tcccttcact tcataaatgt 72480 
caaggaactg attacccaag ggaataaacc 72 54 0 
tctgacctcc agttcagacc tttttcttgt 72600 
tccttgggcc tgcacctaga cctctgccca 72660 
aatcatatgc ctgcccatgt tccagggcct 7272 0 
tctttgagag gtgtgctttt gagagaagca 72780 
gtgcaaacat aattgtggtt ttatggcaaa 72840 
catgatctat cttttccatg tggctgatgc 72900 
gagaaactgc atctggtacg ttgccacctt 72960 
gcaggtaaag cctcatgaaa gccctgctat 73020 
cattctctac cacaccactg aggtcctgct 73080 
tcagccaaga attaccctgg agattgagag 73140 
caataaatga tcaatcccag acctcctgag 73200 
atggcaaaga tgcataacag attcatgagt 73260 
ctttcaggtg catagcagcc tggttggaag 73320 
tttcattatc ctgtctgtca aacaggccac 73380 
ggacaaaaat aatataccaa caagaagtta 7344 0 
taccacggac tgctatacag ccaaagccag 73500 
tctaatctta tgaggtattc aaaagccttg 73560 
ctctattggt tctggtttgg gcatctgggg 7362 0 
ctacagggat gagattaagg ggattacata 73680 
aggttgatga tgaaaaacta aaacttactg 73740 
ccaattttgg aggcataaat tatgagttta 73800 
tgaattaaga atggaaacag agcagaaaga 73 860 
acagtatgtg tgcatggtaa ggagttaatt 73920 
agaaaaagag gataggaaat atgggatgag 73980 
aataatgaat aatactgatc aggtgagaat 74 040 
gaaagaagtg tgatatatta aatcaggaga 74100 
tgagaatgat tgcaggagaa ttctatctct 74160 
aatagaaata atatgtctga agttgaaaga 74220 
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ttaaggggca tgtgcatggt cagcaacaca 
agggacggct gggtcagtgg ctcaccccta 

5 

atggatcacg aggtcaggtg ttcaagatga 
actaaaagta caaaaactag ctgggtgtgg 
10 gaggctgagg cggagaatta cctgagcctg 
tcacgccact gcactccagc ctgggtgaca 
aaggaataaa gaaattgcag aatatttgtt 

15 

agaactcagc tggaagatgg acagatagaa 
ctcattataa agagggcctt gctcccaagg 
20 atgaggctga cagcaattgg cacagattaa 
aaattctgaa gagaaatttc agagaaattt 
acgattttac attggaaata agagagattc 

25 

gtaatggaaa aagttatatc ttctctttaa 
atgtgatcaa attatttccg tttattatca 
30 ggacagaaat gatagaattc attcttgtga 
tttactacct tcactgtttt tttctaagcc 
aaagaaaaaa caacaaagaa acaacaacaa 

35 

tacatagaca gggatgtcaa tactaaattt 
tgatccttac aaatttctga gaaagaaatg 
40 aggagctaga gaaagagaga gacttgtcca 
agactggaac ggagctgctg attctcagtc 
accgagtgaa caatgtcatc agaactatgg 

45 

gaagtagatt agttgattta attactattc 
taaggggttg aattaagaca attgtgaggt 
50 tgtcctttct tcattctcta atggttgagg 
taggcatgaa ggcttcagct gacctaagca 
gaactctctc tctgatgctg atttgcactc 

55 

ctgtgctgcg gtggaaacag gcttactctg 
tcacttaacc ttgcctagtg tatcttatcc 
60 ggtagaagga acaatcagtc agtcatgaga 



ttgagagaaa aacaggcaac ataagagtaa 74280 

taatacaagc attttgggag gccctggtgc 74340 

gcctggccat gacggtgaaa ccccatctct 74400 

tggcgggcag ccgtaatccc agctacttgg 74460 

ggaggtggcg gaggttgcag tgagctgaga 74520 

gagcaagact ccgtctcaaa aaaaaaaaaa 74580 

gtcctgcaat cataaaaaca agataagcca 74640 

aggtctggtc aaagtcatgt ttgtatagca 74700 

taatggtata atgagaaaaa gtacaagtaa 74760 

tacctggcac agccaattcc accttgggaa 74 820 

ggtagtgata aagtataaat tcatgatggg 74880 

tgtcaagttt gcttgctgga acaatttcca 74940 

ctgtgtataa ataccacaaa ccaaagtttc 75000 

gatggccagt tctgcacaaa gaatgtctca 75060 

gttggtgttg gtgattagca actctgacag 75120 

ttacagttac acccccaaaa aaaagaaaaa 75180 

aaatcaagaa gaaatacaaa ggggttcaga 75240 

tacaaaggac ttgcacagtc aaaacctcct 75300 

ttagtatgca tgtttgatac atgaggaaac 75360 

ggcttacaca gctactaagc agtaaaggtg 75420 

ctgggagtta ttacgccact ctgctttctt 75480 

atgtctcaaa gaagtaagaa ataagtgtat 75540 

atctcccttt ctcctctcaa ggaaatgagt 75600 

gtgggaggag cagttttatc ttcctgatgg 75660 

tggtcttcat ctctatggtc acctctcaac 75720 

acatcaatct tttatttctt gactttcagg 75780 

tgctggaatt ctgcctagct gtgctcactg 75840 

acttccctgg ggtgagtgtg ctggccggct 75900 

ctgcactgtg ttgagtatgt caccaagagt 75960 

tacacatggg agggcatttg cattgtgatg 76020 
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gaagacagag aagaaaagca gatggcaatt 
tatgaaaata gttaatcatg agaaatcaac 
5 ggaatgagac tctgggaagt taaatgactg 
tgctgaggac actagaacct ggcttgcctc 
ccactagtct ctttaatagt attgacttgg 

10 

agtgaatgtc ttatgctctg catttgcccc 
aagctatatt aaaaatgtat tgttgaatca 

15 tgaattaaaa agccttgatt tatccaaaat 
gtgataaatt gcaattattt gaagaacctg 
aagaattcta tgtatacagt tggaaaaatg 
aagtctgaag acctaacata ttttgtttct 
acattggtaa ttctggcatg tcctcaaaaa 

25 tgacttctta agaaaaaagg gagaaatatt 
gaaaagttaa ccattataga aaagcaaagc 
aatgaacatt aaaaaaaacc attatttcac 

30 

ctcttgattt gcctgacatt gacttcagca 
agattagcca aatagtctga atttccagaa 
35 tttcccagaa aatgtttccc agaaaactag 
agcaaaggac acaacttttt attgtacccc 
tgtggtgctc agactttttt atacaggtgc 

40 

tagtgcataa aggcaagccc catataatga 
caaaataaaa tttttgagtg catattttgt 
45 aaacacagaa taagtcaaca atgacagaca 
catcaaatat tcaaacaaaa ctaatgagca 
tatgaagcat cgaccagttc atacttattt 

50 

tataaatatt aagtttcagc atgtgtgtac 
tacatgagct ccatttcatt gaaagaactg 
55 gcaggggtca tacccactgt atcagatatc 
tccaggctga aactggactg tcagattctt 
cattatgtct tctccctgtc cattgcaccc 

60 

tttacctact gtggacattc aacttttatt 



PCT/US01/48437 



gagtagctga 


taagctgaaa 


attcactgga 


76080 


tgagtcaatc 


ttcctatttt 


gtcagcgaag 


76140 


gcctggcatt 


atgctatgag 


tgtgtgcctt 


76200 


ccttataagc 


agaaacaatt 


tctgccacaa 


76260 


taaagggcat 


ttacacacgt 


aactggatcc 


76320 


tggtgatctt 


aaaattcgtt 


tgccttttta 


76380 


aacccctatg 


gacttatggc 


tttatttaac 


76440 


tgtattatag 


agtgtagaat 


gaatactagg 


76500 


gtgatatgct 


ctacttatct 


tggattagct 


76560 


gcatatatac 


atctatcttg 


aacctgattg 


76620 


tctagagtgt 


acttttcctg 


cctcacagtt 


76680 


tgactcatga 


ctgtggatat 


gaagaactat 


76740 


aatcagaaag 


ttgattctta 


tgataatatg 


76800 


ttgagtttcc 


taaatgtaag 


cttttaaagt 


76860 


tgtcatttaa 


gatatgtgtt 


cattggggat 


76920 


aaagcacggg 


gctgtaaatt 


accatttact 


76980 


aacaaggcag 


aatgatcatt 


cccagaaaca 


77040 


acagcatgat 


cattcaatgg 


atcacagtga 


77100 


ttaattgtca 


acaggagtta 


actgatttgt 


77160 


tagtgtttta 


tcctatgtat 


tttaactcat 


77220 


agtctcaggg 


tatatgaaag 


tagctggctt 


77280 


tttaacactc 


gttttgttgt 


tttttccctt 


77340 


taaacccaaa 


ttggaactcc 


attttctccc 


77400 


aacaaaagct 


aaccaggaat 


actcattttt 


77460 


tcttattgca 


agaaagacca 


attgtttgtg 


77520 


acacacacac 


gtagaaaagt 


atttcaattt 


77580 


gtcatgtttc 


ccttgctttt 


acccactgta 


77640 


LiydLdL Ltd 






/ / / vU 


tgattttagc 


tttatcttca 


aactctcgtt 


77760 


cccactgaca 


cacaccattt 


cttttattta 


77820 


gacgcttatt 


ggataccaga 


cactatgtca 


77880 
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15 



25 



aataagagaa acagagatga cagtgaaatt agggatatat acagggcact aactattatt 77940 

tgaacacaga ataaagcata ccttcttctg cctaaaagaa tgtaggggat aaacaggtgt 78000 

5 

gtgtgtgtgt gtgtttgtgt gtgtgtatgt gtatgtgtga gagagagtcc ttcttttgaa 78060 

aatccacatc ccccatatca atcacatggt gccaccttct gccacaggag aggacaatga 78120 

10 ctcagagtcc acaattggtg tattcatcta ctgtactcac actccagctt tgttcctcac 78180 

cccctgccac catgtcttca ccaccaccac agcactctgg ataccagaag tcttactgat 78240 

tacccagtca aacttctggt cttcaaagtt ccccccgggt ctctcttcag ctttgacttg 78300 

tttatcagag aaggcatcca gccctttagg caacactcaa agccactttt tatcctaggc 78360 

tcaacctcac tgccatgtca tccactgata tcttttcttt caagtagcaa tgccccttca 78420 

20 gcccaagtca agtgagaatg atgcctacct tgagatagaa gatgtattga gataaccttt 784 80 

gtaaagcagt taccaatata gctgacacat gaaagatgct cagtgaactc gagttttatt 78540 

gtgtgtttac ttccctttgc tttcacttcc atagtccaac tttccttccc cttccacttc 78600 

caccctctat gtctaaaatt aacttcaacc ccaatacatc gctgggattt ggaacctgat 78660 

tatactcatg aaatctttca tttcttcaag aaagtctggg ccaggcacag tggctcttgc 78720 

30 ctgtaacccc agcactttgg gcggttgagg cgggcagatt gcttgagttc aggagtttga 78780 

gacaagcctg gagaccatga ccaaacccta tctctataaa aaaaaaaaaa aaaaattagc 78840 

caggtgtggt gacacatgcc tgtagtccca gctactcaag aggctgaggc aggaaaatca 78900 

cctgaaactg agaggttgag gctcattgag ccatgatctt gccactgcac tctagcttgg 78960 

tggacagagc aagaccttga aaaaacaaaa aaaaaggaaa gagaggaaaa aaaagagaga 7902 0 

40 aagaaagttg tagaatgctc taaataggat gactcagact gtgtaggaaa catgttttct 79080 

cttccctttg caggagaaag attttagaat tattggcctt tcccaatttc tgcacagttg 79140 

actctactga caccttatgg tgataacgag gaacaacttt tctcccaaga gaaatagaaa 79200 

aaggcaaaac aaggtatctg gaattcactg gaggtatcta acttgaccac aggaaatcac 79260 

acttgctacc tttgtccttt acagtgtctc cacatgtcat cagtgagggg aaactatgca 79320 

50 tttttcaaaa gttatttatt attgtaagga aagtggctgt gcttcagtca ggagtaggcc 79380 

aaggtaaaca tccggtatgg tatgacacag cgggtttgga gcgcaggtgc acaaccccat 7944 0 

gcattatgta accatttact ataatctgtt tgtgtgagct catacctggc tttgagccac 79500 

tctgtctgtg agtaatataa ctgcactgct gactctgtag gacaggagag agaataaagc 79560 

cacgttccaa ctgcctacaa tccaagtgtt ctttcagcta cccaccacgt gtccaccagc 79620 

60 tcccttcaga acccagctca ggttggaacc tgtcaattgg catagtcagt aggatcccaa 79680 



35 



45 



55 
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gatgagtgag cccttggcct caatgatccc 
tggtacctga tggtagctgt gctgctagga 
5 gacaggtctc ctgtgagcat ggagaaggca 
aggacaatac ctttgctggc atagttcaat 
acactcagtc cctgcaggat gcagcaaagg 

10 

agaggcccag acacacagct tggaatgaga 
cccaccatcc cagtcagaga ctcctgctca 
15 gcgagtttgc ccagtggtcc atcagaaaat 
ttgggcttgg ggacccccta ccatgacaga 
tatattgtgg gaattagata aacagtgtca 

20 

gatgcctcat ctttgggatg agggagccaa 
gaagttggct tccattatga ctcatccctc 
25 gttagcacag ggcaaggcga ccacacccta 
atatggaaag atgccaaaga aataccagaa 
ttagtgcatg taatttggga gatgggtgtg 

30 

gccaaatgat gaacacttta cctcccacat 
gagtgccttt ggttccctgg ccactgtcct 
35 tgactacttc catggcggcc ctcagggaag 
gtgccataaa aaaggggaaa gtttccctac 
atccccagta ggtgacctgc acacagatgt 

40 

gaaagaaaat tgataagcaa ctccatgaag 
cagagcagca attctggaaa atgcctaaga 
45 gagtgtccca cccagacact ccagctcaag 
cctttttttt ttttttcgag agagacggag 
tggcgtgatc ttggctcact gcaagctctg 

50 

agcctcccga gtagctggga ctacaggcat 
ttttagtaga gacggggttt caccgtgtta 
55 gatccgccca tctcagcctc ccaaagtgct 
gctgttcttg attatttaaa aaatttaata 
atttaaaata cattttctta ttgctttcag 

60 

tatttctggg atgtgaaatt ggttcaacat 
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aggtcagcca 


tgtggcccca 


gcatgtgttg 


79740 


tgggccttgg 


tggaaacatg 


ggtggtggtg 


79800 


ctgaagtacc 


tgtaagcaca 


gagcaatgag 


79860 


gggcgttttt 


gactgtacta 


cggtaagtgc 


79920 


taggggacct 


ccaggcacaa 


acagaaaccc 


79980 


attagagtct 


gccattagtg 


tgggcttggg 


80040 


gtctgatact 


gaggaagaag 


aacatcagtt 


80100 


agagcaggaa 


cagtcattgg 


ggccccaggg 


80160 


gcacatgtct 


tacagtgcct 


ataccccaat 


80220 


gcagtgcttg 


ggggaacccc 


tacctgcctc 


80280 


cagcatcgtc 


tgctctgcct 


ctgaaatgga 


80340 


cctccgtcag 


tgattgcagg 


tgagcaggtg 


80400 


attgaatggc 


tgatggtggc 


catttgaacg 


80460 


actgtgagta 


aatggcagtc 


gtatctagat 


80520 


tggcaggctg 


ttccatctga 


ctacccaagg 


80580 


gagggacttt 


gtgttgggct 


ctgtgcccct 


80640 


cactctatgt 


ggggtgctgc 


atacataagg 


80700 


cagagggctg 


taggtgagac 


cagagagtct 


80760 


tccagggggc 


tacctcttgg 


gacaaaatgc 


80820 


agattgattt 


gattttggct 


ggggttgcgt 


80880 


tactgttaac 


tttgtggagg 


caattgtccc 


80940 


gggagaagga 


tgatgcatcc 


tgcagggact 


81000 


gactactaga 


agatgggtgg 


aggtataaag 


81060 


tctcactctg 


tcacccaggc 


tggagtgcaa 


81120 


cctcccgggt 


tcacaccatt 


ctccttcctc 


81180 


ctgccactgc 


gcccagctaa 


ttttttgtat 


81240 


gccaggatgg 


tctcaatctc 


ctgacctcgt 


81300 
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tctcaatacc 


aattatacta 


tttactgtac 


81420 


aagtttaatt 


ttattatttt 


tggtagggtt 


81480 


atacaaatca 


ataaatatga 


ttcaccacat 


81540 
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aaacagaatt 


aaaagcaaaa 


accatatgac 


taaaattcac 


catccttcat 


gttaaaaatc 


actcaaaata 


ataagagcca 


tctatgacaa 


actggcaaaa 


gcaggcagca 


ttccccctga 


catcacttct 


attcagcata 


gtactggaag 


aaataaaagg 


catccaaata 


gaaagagagg 


aacgtgattc 


tatacttata 


aaatcctaaa 


aacaacttca 


gtaaagtttc 


aagatacaaa 


tataccaata 


atgttcaagc 


tgaaagccaa 


caaaaagcat 


aacataccaa 


gagtagagct 


gaactacaaa 


acactgctga 


aagaaatcag 


atgctcatgg 


atagaaataa 


tcaatattgt 


ttaaaaaatt 


gtatatatcg 


atttttatca 


taacaaagtc 


tttttttctt 


atttgatgtg 



30 ctattgtcaa attatatatg aataactgga 
ttaaaaatac atcttcctat tctatttgcc 
tatatgaggg agagtgattt ataatttctc 

35 

tgtgcctgta acaaattacc acaagctttg 
gttctgtggg tcaggataca acactggtct 
40 actactttcc tctctggagc ttctagggaa 
cttgaggctg tctcattcat tgactcatgg 
gtccttctca tgttgcacct ctgactattc 

45 

tgggaaaggt tctccactta taaggactct 
caggataaac tctgtgttaa gatcaactgc 
50 aaaatcattt gccagatatc atagcatact 
ctttgggtac tattattctt ggtaacatac 
tagaagaaaa ataacaactg tgtcctggcc 

55 

catgtggtga ggcaatgagt cataaccaac 
ggtgtgacat gagctgatga aaaagaactt 
60 acaaatttgg aagttttaaa atggtaccct 



atctcaatag 


atgcagaaaa 


aggtttttga 


81600 


cctcaacaaa 


ttaggcatca 


aaggaacata 


81660 


acccacagcc 


aacagccagc 


atcatactga 


81720 


gaactggaaa 


aaggcatgga 


tgattattct 


81780 


ttctagccag 


agcagtcagg 


caagagaaag 


81840 


aaggaagtca 


aattgtctct 


cttcacagac 


81900 


gactccagca 


aaaggctcct 


agaaatcatt 


81960 


atcaatgtag 


aaccattact 


agcatttcta 


82020 


attaagaatg 


caatctcatt 


tacaatgcta 


82080 


aaccaagtag 


gtgaaagatc 


tttacaataa 


82140 


agacgacatg 


aacaaatgaa 


aaaacagttc 


82200 


taaaatggcc 


atacttctcc 


atgaatttct 


82260 


aaagctttgt 


ctgcatacct 


ctgttgagat 


82320 


tttctgcagt 


aagtgatttt 


cgaagatttt 


82380 


atgagttcaa 


tttggccatg 


accaaatgct 


82440 


acttttattt 


ttcagtttct 


tacatctata 


82500 


attatcacac 


tgcttatatt 


agttttctat 


82560 


tgacttaaaa 


caatacaatt 


ttatcttaga 


82620 


cactgggcta 


aaaatcaagt 


tatctacaga 


82680 


aaaaatccat 


gaccttgcct 


tttctagctc 


82740 


gctttttcaa 


ggctggcagt 


ggcagatcat 


82800 


ttctatcctc 


aattctctct 


tagacttacc 


82860 


tattattaca 


ttagaattgc 


cataattctc 


82920 


ttagcaaact 


tgattctatc 


tgtaactttt 


82980 


ctcagattcc 


agggagtagg 


atgtggataa 


83040 


tgcctttttt 


gatgttgtta 


ttgcagaaaa 


83100 


gacactggct 


gggagctcat 


tgtcagtaga 


83160 


gtgggtttct 


ttttgtccca 


tgaaagattt 


83220 


aattcaagta 


ataaatttat 


ttctattttc 


83280 


gtttcctggt 


tccttgagaa 


gttgtggggt 


83340 
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tgaaaggtgg gaaatcaaat gcataatcag catcagcatt acattaaaga gtacacgtgc 83400 

aggtttgtta catgggtaca ttacatgatg ctgaggcttg aggtcccaaa aatcccatca 83460 

cccaggcagt aagcatagta cccaacaggg gctcttcaac ctgttcccct ctccctccct 83520 

tccccatcga gtgagtggtc ctcaatgtct attgttccca tctttacatt catatgtatt 83 580 

caatatttag ctcccacttg ctagtgaaaa caatgcagta tttggttttc tgttcttccg 83640 

ttcatttgct taggataatg gcctccagtt ccatccatgt tgctgccaag gacaagcttt 83700 

cattgttttt tatgatcgca tagtattcca tgttttatat gtaccacaat tcctttatcc 83760 

15 aatccaccat ggacaggccc ctaggttgat tccatgtctt tgctatcatg agtagcactg 83820 

caatgaacat atgtgtgcat gtgtcttttt gatagaatga attattttcc tttgggaaaa 83880 

cccagtaaca ggattgctga gtcaaatggt agttgtgttt ttaggtcttc taggaatcac 8394 0 

cacactgctt tccacagtgg ctgaaccagt ttccattccc accaacagtg tgtaagtgtt 84000 

ctcttttctc cacagccttg tcggcatctg ttgttctgtg actttttagt gatagccttt 84060 

25 atgactggtg taagatggta tcttattgtg gttttgattt gcatttctct gatgattcat 84120 

gatgttgaac attttttcat atttgttggt tgcttgtatg tcttcttttg aaaagtgtct 84180 

gtgcatgtcc ttttcccatt tttaattgga tttttttttt tttttttttt ggggtttggg 84240 

gtaaaaat 84248 



20 



30 



35 



<210> 78 
<211> 38736 



<212> DNA 
40 <213> Homo Sapiens 



45 



<220> 

< 2 2 1 > genomi c_DNA 

<222> (1) . . (38736) 

50 <223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 

<220> 

55 <221> initial_coding_region 

<222> (7631) . . (7729) 

60 
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<220> 

<22l> coding__region 
5 <222> (10476) (10682) 



<220> 

10 

<221> coding_region 
<222> (12601) . . (12687) 

15 





<400> 78 
nccattcatt 


ccaccatgtg 


aggacacagt 


ttgtcctatc 


tggaagatgc 


agcaacaaga 


60 


20 


taccatcttg 


gaagcaagca 


acgtgaccct 


aagtagatgc 


tgaacctgtg gcaccttgat 


120 




cgtggacttc 


tcagttccaa 


aactgtgaga 


aataattttc 


tattttttat 


aattacccag 


180 


25 


tttgtggtat 


tctgttatag 


cagcacaaat 


ggactaagaa agaaactggt accaataagt 


240 


ggagtactgc 


tgtatcgaac 


acctaaaaat 


gtggaagtgg, cattggaact gggtaatggg 


300 




tagaggctag 


tctctaacag 


acgattctgg 


taaggactaa 


gaagaagaga 


gctgtagaca 


360 


30 


gaacctcagt 


cttagagatt 


attttagtgg 


ttgtaattac 


aatatcaatt 


aaaatgtggg 


420 




cagtgagtgc 


cattctgatg 


tggtgtcaga 


tgaaaatgag 


aaataattaa 


aaactggagg 


480 


35 


gggccaggcg 


cagtggttca 


atgcttgtaa 


tcccattcac 


tttgggaggc 


cgaggtgggc 


540 


agattactgg 


aggtcaggag 


ttcaagacca 


gtttggctga 


tatggtgaaa cctcgtctct 


600 




accaaaaata 


caaaaattag 


cctgtcatgg 


tggtgcacac 


ctataatccc 


agctactcgg 


66 0 


40 


gaggctgagg 


ctgtagaatt 


gcttgaacct 


gggaggcgga 


agttgccatg 


agccaagatc 


720 




gcaccactac 


actccagcct 


gggtgacaaa 


gtgagactcc 


atctcaaaaa 


aaaaaaaaaa 


780 


45 


aaaaaaaaag 


tggatcaaag 


gccatccttg 


ttatcaagtg gcaaagaata 


tggctgaatt 


840 


gtgtctatgt 


cccagggttt 


tgtggaaggc 


agaacttaaa ggcaatgaac 


taggatgttt 


900 




ggtgaaataa 


atatctaagt 


gaagtgttca 


gggaactgac 


tgcttataat 


aaaatacggg 


960 


50 


aagagagaaa 


tgaattaaaa 


gataaaatgt 


ataatcaaaa 


agtaagtgaa 


atttaaagat 


1020 




ttagaaaatt 


atcagcctgg 


ccatgtgaag 


ataaaaggct 


gttcaagaaa 


gaaaattaaa 


1080 


55 


ggtatggcca 


agctactgtt 


taaataagaa 


atatatatgt gtaggtagaa gccagatact 


1140 


attcactaga 


acagtggagg 


ggtgatccca 


gtgatatttg gaagatcttt ggggctccca 


1200 




ttcccattac 


aggtgccaag 


tgctaagttt 


ttattgacag atgaatggat aaactgtgag 


1260 


60 


atacacacac 


acacacacac 


acacacacac 


acactcatat 


aattcagact 


taaaaaagga 


1320 



-194- 



WO 02/062946 

gatcttgcca tttgccacaa catggaagat 
ccagacacag agacatattg cacaatctca 
5 acaactaaca atatagaatt cagagagata 
agaagagaga gagagagaaa actggtaggg 
aagataagta ggatgagcaa atctagagtt 

10 

aacaaggtat tgtatttggc atttttgcta 
catagaaata caaacgagtg actatttgaa 
15 ttattatact ttaagttctg ggatacatgt 
cacgtgccat ggtggtttgt tgcacccatc 
aatgctatcc ctccccttgc cccccacccc 

20 

ccctgtgccc atatattctc gttgttcaat 
tggttttctg ttcctgtgtt agtttgctga 
25 ctgaaaagga catgaactca ttctttttta 
gccacatttt ctttatccag tctatcattg 
tattgtgaat agtgctgcga taaacatacg 

30 

ataatccttt cagtatatac ccagtaatgg 
taggtccttg aggaatcacc acactgtctt 
35 ccaacagtgt aaaacaattc ctatttctcc 
ctttttaatg atcactattc taattggcat 
atttctctaa ttaccagtga tgatgagctt 

40 

tcttcttatg agaattgtct gttcatatcc 
gatagaatat cgttcagtca cccaggctgg 
45 acctctgcct gctgggttca agtgattctc 
caggtgcata ccagcacacc tgactaattt 
cgtgttagcc aggctggttt tgatctcctg 

50 

aagtgctggg attacaggcg tgagccaccg 
gggttgtttg tttttttctt gtaagtttgt 
55 cctttgtcag atggatagat tgcaaaaatt 
ctgatgatag tttcttttgc tgtgtagaag 
attttggctc tttttttttt tttttttttt 

60 

ggctggagtg cagtggcgcg atctcggctc 
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actggagaaa cttatgctaa gtaaacttac 13 80 

tttatatgtg gaatctaaaa taacaacaaa 144 0 

gagagtgaaa ctgtggttac caggggtaga 1500 

gagagcagca ggtcaaaaga tacaaagtga 1560 

ctaatgtaca gtacaaagac gatagttaat 162 0 

aatgactaga ttttagctgc tcttgccaca 1680 

ataatagata tgttaatttt taaattttta 1740 

gcagaacgtg caggttgtta cacaggtata 1800 

aacccatcgt ctacattagg tatttctcct 1860 

cagacaggcc ttggtgtgtg atgttcccct 1920 

tcccacttat gagtgagaat atgcagtgtt 1980 

gaatgatggt ttccagcttt atccatgtcc 2040 

tggctgcata gtattccata gtatatatgt 2100 

atgggcattt gggttggttg aagtctttgc 2160 

tgtgtatgtg tctatatagc agaatgattt 2220 

gattgctggg tcaaatggta tttcaagttc 22 80 

ccacaatggt tgaattaatt tacactccca 2340 

acatcctctc cagcacctgt tgtttcctga 2400 

gagatggtat ctattgtggt tttgatttgc 24 60 

tttttcctat gtttgttagc catataaatt 2520 

ttcacccact tttttttttt ttttttttga 2580 

actgctgtgg cacaatctca gctcactgct 2640 

ctgcctcagc ctcctgagta gctgagacta 2700 

gttgtatttt tagtagagat ggggtttcac 2760 

acctcatgat ccatccgtct tggcctccta 2 820 

cgcccggcct ccttcaccca ctttttgatg 2 880 

ttaagttcct tgtagattct ggatattagc 2 940 

ttctcccatt ctgtaggtcg cctgttcact 3000 

ctcttttgtt taattagatc ccatttgtta 3060 

tttttaggcg gagtctcact cttttgccca 3120 

actgcaagct ccgcctcccg ggttcacacc 3180 
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attctcctgc 


ctcagcctcc 


cgagtagctg 


5 


aatttctttt 


gtatttttag 


tagagacggg 


tctgacctcg 


tgatccgccc 


acctcggcct 




accgcgcctg 


gccaattttg 


gcttttgttg 


10 


ttgcctatgc 


ctatgtcctg 


aatggtattg 




ttaggtctta 


tgtttgaatc 


tttaatccat 


15 


aagtggccca 


gttttagttt 


tctgcatatg 




aatatggaac 


actttcccga 


ttgcttgtct 




taaatgtgtg 


gtgttatttc 


tgaggactct 


20 


tggtaccagt 


accatgctgt 


tttggttact 




gtaaccattt 


tattctctat 


atgtatctca 


25 


caatgaaatt 


tattttacaa 


aatgtaagtg 




cttaagacat 


gtcccctaag 


aataatattc 




cccaaaattt 


attaatataa 


taataatcca 


30 


tattccctaa 


ttataaaaat 


atgaggataa 




aacacataaa 


ttatggttca 


ctcatacatt 


35 


tctactcata 


gagcagtttc 


tcaccatttt 




cttttatgac 


actgactttt 


gtttttgttt 




gccccatatc 


ttggacattt 


cttcttgttt 


40 


aaacacaaca 


tagatgatgt 


gacttaccaa 




aataacagtt 


tatttcaatt 


ttggtggtgg 


45 


atcagttgaa 


tgtttatgtc 


ttcccgaaat 




atggtattaa 


aaagtggggc 


ctttgcaagg 




aatgggattg 


gtgctcttat 


aaaagagacc 


50 


gagaggacac 


agcaagaaga 


cagacatcct 




gaggatatgc 


cagcaatttg 


acgtcagatg 


55 


gtttgttttt 


tcagccacct 


agattatggt 




gacaggtggt 


attcacatgc 


tattttcatt 




aaaggaatca 


gtagtcttat 


gcttttaaga 


60 


ttcactgaat 


ggtgttagta 


tccattaaat 



ggatacaggc 


acccaccacc 


acgcccagct 


3240 


gtttcactgt 


gttagccagg 


atggtctcga 


3300 


cccaaagtgc 


tgggattaca 


ggcataagcc 


3360 


caattgcttt 


tggttttatt 


catgaagtct 


3420 


cctaggtttt 


cttctatggt 


ttttatggtt 


3480 


cttgagttaa 


tttttgaata 


agctgtaagg 


3540 


gctagccagt 


tttcccaacg 


tcatttatta 


3600 


ttgtcaggtt 


tgtcaaagat 


cagatggctg 


3660 


gttctgttcc 


attggtctat 


atatcttttt 


3720 


gtagccttgt 


aatatagttt 


gaagtcagta 


3780 


taatatcatg 


ttgtatactt 


taaatatgta 


3840 


cttgatatcc 


tgatacttaa 


tgtctaaata 


3900 


tactatatag 


ctttaataca 


attattacat 


3960 


atatatagtt 


cttattacaa 


tttcctccat 


4020 


tggaggggct 


tatgtgttta 


tatataaata 


4080 


tggttctctt 


gtcctttgag 


tctttcataa 


4140 


ttgtttgttt 


gattttgcat 


ttattttagt 


4200 


ttttagtctg 


gagcatttgt 


ttcatataat 


4260 


tccatgagtt 


caagttaata 


attttaataa 


4320 


cttttcattg 


catcacataa 


ggatgaacat 


4380 


ttactttaat 


caattgattc 


gggtggtgtt 


4440 


tcatatgtta 


aaattctaac 


ccctaaagtg 


4500 


cgataaggta 


acaaaggtgg 


agtcctcatg 


4560 


ccacagagct 


ctcttgctcc 


tttcgaccat 


4620 


tgaaccagaa 


agttggcctt 


catcacactt 


4680 


tcctggcctc 


caaaatcgtg 


agaaataaat 


4740 


atttttgtta 


taacagtctg 


aacagacaaa 


4800 


ttaaaggtat 


atttttaact 


tcataactaa 


4860 


caaagtgaat 


attatgtttc 


aaaaaatatt 


4920 


ttgtcatctc 


atcctctatt 


ctcagcctaa 


4980 
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atgtattagt ctactttatt actcaagtta 
gtataaaagt tttgcatttc ctgggaacac 
5 tcctaattat gtgtcaatgc tcattattca 
ctctttggac ccaaaacaac tatctgagtt 
tcccatctgc atctcagctt tcatttccca 

10 

aatcttctat tccctaagga catctgtcct 
tggcatggtt taagtcaatg tttccccatg 

15 gattcagaga accatcaacc ttgtcaaaac 
aatcataatt tggacagaga tatttcaaga 
taccaaataa tatatgtaag acacaaagag 
taacaaataa gattttatct tgttaaaaca 
agatgactct ttctagctgt tgattcctcc 

25 gcttaattgt cactttgagg aaaatgtttc 
tccttcatta taggaagaaa cttctcactt 
aatagcttat gctgaacatt agcccttcct 

30 

accaggctgt gttttcaact gaggacttcc 
attacaagaa agattaaaag tcttgcaaac 
35 tttaagactt tcttagtact tttcttaaga 
gagttagagg aatatagtaa tggaggaagg 
ttttgttttc cagcaatcca aagaacaata 

40 

agaaagatat atcaaaagag aagagttcct 
tggaggggga aacacgataa ataagagaag 
45 cctggtcctg aaggagaggg ttggagagtg 
tgggaaacag ggctagctga gaaacaaaaa 
aaggccttct tattctctaa ggagggatcc 

50 

tgcccatgga cctgcaggaa caagagatat 
atattgttag agaaacacca aaatttatct 
55 tgtttctgag ctctgccacg ggaacttggg 
agacgcatca aaatgagtag aaaaagtcgt 
ctgatacagt gagactcgat ctggagaaaa 

60 

gcaacaggag tgggccatga gtccagcatt 
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caactgaaag 


acggaaactc 


ggatatcaag 


. 5040 


ataattgtag 


taaactctta 


gataaattga 


5100 


tgaggaagtt 


agagaataag 


aaggccttaa 


5160 


tttgtttctc 


agctagtcct 


gtttcccagg 


5220 


ctctccaact 


ctctccttca 


cgaccagggg 


5280 


tctcttattt 


gatcttatgt 


cagtttgaca 


5340 


tagctgaggt 


gctaagtgca 


aatattaact 


5400 


cataaaactg 


ctctttatta 


gttgacccaa 


5460 


cattttcaga 


tataaaccat 


aaaacaaata 


5520 


aaagcaattg 


atccgaggct 


actatgtata 


5580 


cccgcatgcc 


cgcataaatc 


agtcagaacc 


5640 


tcctcacaaa 


gagcccacac 


gcttcatttt 


5700 


ttcctttctt 


tccttcacac 


ttccctcttt 


5760 


acatatgaga 


aatcacagtc 


acttttgtga 


5820 


tctctggatc 


ccttgaatgc 


tggaatagtt 


5880 


tgttgcccaa 


tactaatgta 


tccaatgcag 


5940 


acatgaattt 


ttttctttct 


atcccaagca 


6000 


taatatatgc 


aaagggagag 


ataagtggag 


6060 


aaaaggagat 


tttgcttact 


agtttctagc 


6120 


tctcactaaa 


aagtttatga 


tacatgcttt 


6180 


agagtacagg 


gtatgagagg 


tatcatggaa 


6240 


gacacatgtc 


cttagggaat 


agaagattcc 


6300 


ggagatgaaa 


gctgagatgc 


agatgggacc 


6360 


ctcggatagt 


tgttttgggt 


ccaaagagtt 


6420 


ctagaactag 


aaatgccagg 


gagacagttt* 


6480 


tttatggaga 


atgtgcacag 


agagttgtat 


6540 


ccaactcctt 


tgttgtcaaa 


cataaatatc 


6600 


cty catLLL cty 


y *-y ciy ty i— cty 


ciy ccdyy ciad 


DOOU 


ggcattttac 


tcgtcctggc 


ctctttcacc 


6720 


ccctgaagtc 


ccagcttctc 


ccttagggaa 


6780 


ctgggtttcc 


agcggggttg 


cccaaggcac 


6840 
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tggtttctgt cttacatgac tcagagtgca 
gaacaaaggc aagcaccctg tttcagcact 

5 

agggggagaa agaggttaca ggctccagac 
attacacaca caagcccata aaaagcttga 
10 caataaaggt cttccaatgt atgaagccac 
ttttcaaata ttaaaagcac tgcaggataa 
aaacatatct aagatgcatg ctaatagaca 

15 

aaaaattcat caaatatatg gcaatacagt 
tgctgcatat tgcaaatttt gtgtgaccct 
20 tcatgctgct gccaccctgt gatgtgttta 
ggaaaaatcc tcatgatacc aaggttgtgt 
ttctctaaca cacaggccaa gtctggacta 

25 

gccagaactt gggagctctg tacttttaag 
caacaccatt atgacatcac aacctatttc 
30 tgtcatcaac ttctcccaag cagagaaacc 
gaagaaacgt ctacaggcaa aagtcaaagt 
ggagagqggt taggggaagt gggaagagat 

35 

gagtgtggag acccttaagt tttgctggag 
actgtcccag aacttttcca caggagattg 
40 tatgaggagt tgattttctg ttgttgtctt 
ttttctttta cgaaagtaat ggtggtaaat 
tttcacaact gcatgcttaa caatccacac 

45 

gcaaattaca gaaaaaatat acctcctctc 
aaatgatatt taccttttat ttgtctgttt 
50 tgtattgcat gaagggagga gagaaagaag 
gagttttggt ttctagaatt gcctctgcac 
gactggattt caggacctgg tgggactggt 

55 

tctatgaggc tcaggtgaca aaaggtctag 
tgaatctctg aggtggatgt caggaaggaa 
60 acatgagaac atgcagtatt tggttttgtt 



gagtttggat gccatgttgg gggcctctga 6900 

agggagattg tagtactgaa ggcggacacc 6960 

aagaaacaag caaacctctt taactgggaa 7020 

agggccccta gaatctctag ctatgctaac 7080 

tctgtaatga ctggaggagg ttgggtgtat 7140 

aaactaagag catgtctagt aaaatatgat 7200 

aaaaataaat gattagagtg catttgtcta 7260 

tgttgtcagt aggactgggc aaagggacat 7320 

ttgcctctgg atattgggac ctctcaagat 7380 

ttatctataa tggactgtgt caaacattat 7440 

ggctttcact gctccataat atcagtttct 7500 

attggcagaa actgacttac cagttttgca 7560 

agctaaatct attttttctt ccgtagttgg 7620 

caatgagacc atcataatgc tcccatcaaa 7680 

cgaacccacc aaccaggggc aggatagcct 774 0 

tattggggta aatctaattc agaacatgtt 7800 

gatgtatgtt ttgggactgg tcatccttgt 7860 

ggactttggg gtagaagcca cttggagaaa 7920 

tcttagaaac gttttttccc ttattctgaa 7980 

tgtatttttg caatacagta cttctctata 8040 

aatgacatat tgtatataca tttttctgta 8100 

tgattctttg caaggaattt ttcccttact 8160 

cctgttttct tcttaccttt ttgagaatgg 8220 

ctcctctcat ccagagttta ttaaagaagc 82 80 

ggatctatct tggattggga gggaaacaca 834 0 

ttatgtgatc tgaggtgagg cactctgttt 8400 

ttccttgcct catgtacaat catatgggtt 8460 

gaaacttctt ggtaaatgat aatagagctg 8520 

aactacctcc taagacagag cttcagttat 8580 

agtttgctaa gggtggtggc ctctaaatga 8640 
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tgagaacaca tggacacgta gaagggaaca 
ggtgggagga gggagaggat caggaaaaat 
5 tctggctaca aaaatctccc aaagccagca 
ccagttcttt ctttccctcg ttggtattta 
tccaacacta gatctgtgct gtgcaatgta 

10 

tctatgatga tggaaatcca caacagctgc 
atcgatgatg tgaaatatgg atagtgcaac 
15 aaatataagt agcccatgtg gctactggca 
tgttttcaaa cttttagtag gaaatatatg 
tgaacatgca ggaagctcaa ggatgcagga 

20 

gagacatcga ggctttagtg agttctactc 
attgaagaag agtgccatct caagacaaat 
25 gcatcaagac tactactggc tactaggagg 
gggaaattgg ggtcccagag catcaactgg 
ggattaaata acactctact gtatggctgt 

30 

acaatagctt tcagaatgct gtgagctcag 
aagtcacttc tgaggtcagt ttatttattt 
35 cagagtggga aataaaacca aggaactgag 
tattgtgacc tcaattttct gacatccagt 
tcccacacat tacaccatcc ttgggcccgc 

40 

ctgcgctcaa tgtggccagt catatgcctg 
aaccaccccc tcactcatct ttgagaggtg 
45 agtgagctat taggttggtg caaaggtaat 
acccaaaatt acttttgcac catgctaata 
ttgcacgagg ggtttctgag ttggccgaga 

50 

tgcttgtggt cacactcaca gtaattttcc 
ccagggaaag aatttaggag agtaacattc 
55 ggaaccagtt catatgttca ttcattcagc 
ggctggacac tggagaggcc tacaacaaga 
cagataaggg aataacttgc aatgtaatgg 

60 

ttgggaatga ttcttaccca gcatgtctct 
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atgcacactg 


gggcctttca 


gagggtggag 


8700 


aactaatgca 


cagaaactca 


atttggaagc 


8760 


aggaagtcag 


agagagtgcc 


cagaaaaatg 


8820 


gagattattg 


tttggaaccc 


tcagttacaa 


8880 


actgtctatg 


atggaaatga 


aatgtaactt 


8940 


actgttacaa 


tatggcggtc 


acatgtggct 


9000 


tgaattttaa 


attttactta 


attttacttc 


9060 


accatactga 


gtagcctagt 


acaagagtaa 


9120 


ggcctcttac 


tccagagaac 


aatcccacag 


9180 


tgtggtctcg 


gggtgggctc 


agactcacca 


9240 


ttgtcacaca 


ttgtaccagc 


cctttttgag 


9300 


ctagcttatt 


ctaagaaata 


tagactctag 


9360 


cccaggagtg 


tacacttcag 


tcattatgtt 


9420 


tgagttccta 


aaagaagaag 


tgcccatatg 


9480 


gggatgtgcc 


ctggagccta 


cctacctgta 


9540 


ttctcttttc 


tgtgtgagtt 


ggagtaaagg 


9600 


cctccacttc 


ataaatgtag 


aaatgagaac 


9660 


tacccaaggg 


aacaagctgg 


gtcaagtcta 


9720 


ccaggccttt 


attctgtacc 


tcccgctgcc 


9780 


acctagacct 


ctgcccacag 


ccaccccatg 


9840 


cccaggttcc 


agggcctgcc 


tgagactcag 


9900 


tgcttttgag 


agaggcaata 


ggcacgcaaa 


9960 


tgcggttttg 


ccattgaaag 


tagtggcaaa 


10020 


caccatccag 


attttctgtg 


tggttgatgc 


10080 


gagaaactgc 


atctggtaca 


ttgccacctt 


10140 


aggcaagcct 


cgtgggccct 


gctatctgtc 


10200 


tctaccacac 


tactgaggtc 


ctgtttgagg 


10260 










aatgatccct 


ccgggacttc 


ctgagggaga 


10380 


caaaaaggga 


taagagagat 


tcgtggccct 


10440 


ttcaggtgca 


tagcagcctg 


gctggaagca 


10500 



-199- 



WO 02/062946 



PCT/US01/48437 



ttctgagtgc tctgtctgcc ctggtgggtt 
taaatcctgc ctcattgcag tgtaagttgg 

5 

tttcttatga ttatcattca ccttacacca 
ctgtaagtat tttttagatg ggatgttcta 
10 cttattattc catcctttga aaatctctct 
gccaaggtta tgtaaatcaa aggggggctg 
acagagtgga attgaaatgt cagataaggt 

15 

tagagtttct taattactca actaatctaa 
actgagaatg ttggtacata gagtggtgaa 
20 gggtgatcag aaaatgctga caaggaaatg 
ggtctaaaaa gacatcagac taaagatgaa 
tggtaagggt ttctcctaat gattacaata 

25 

cattattatt gctttttgtt atagctgaaa 
aaatataggg agatggagag tgtaactgag 
30 aagaagaaaa acaatgaagt ttggctaaca 
ggggtatgtg catggccagc aacgcattga 
gttgcagaat atttattgtc ctgcaatcat 

35 

acgatggaca gatagaaagg tctggtcaaa 
gagccttgct tccaaggtaa tggtataatg 
40 gtaattggta caaattccac cttgggaaaa 
tagtgataaa gtataaattc atgatgggac 
ttgagcttgc ttgctggaac aatttctagg 

45 

ctgtgtgtaa ataccacaaa acaaaagttc 
gatgaccagt tctgcacaaa gaatgtctca 
50 gttggtgttg gtgattagca acagtttact 
caccaaaaac agcaaaaata aaaaataaac 
agatacatag acagggatgt caatactgaa 

55 

ccttgatcct cacaaatctc tgagaaataa 
aacaggagct agagaaagaa agagacttgt 
60 atgagactat aatggagctg ctgattctca 



tcattctcct gtctgtcaac ccggctgcat 10560 

acgaaaagga tataccaacc agacttcttc 10620 

tggactgcca tagagccaaa gccagtctgg 10680 

attttatgag gtttctgaaa gccttgattt 10740 

gttaattctg gtttgggcat ctggaggaag 10800 

catggatgag attaagggga ttacatacta 10860 

tgatgatgaa aaactaaaac ttacggtttg 10920 

ttttggaggc ataaattatg agtttgtcaa 10980 

ttcagattgg caaacagagc agaaagaccc 11040 

tggtatgtgt gtgtggtagg gggtcagttt 11100 

gaaacagaga ataggaaata tggggttaga 11160 

atgaataata ctaatcaggt gaaacggtaa 1122 0 

gaagtgtgat atcttaaatc aggagataag 11280 

aattatcgca ggagaattct ctctctttgg 11340 

gaaataatgt gtctgaagtt gaaaaattaa 11400 

gaggaaaaca ggcaacataa aagtaaagaa 11460 

aaaaatgagg tcagccaaga actcacctgg 11520 

ttcatgcttg tacagcaccc attataaact 11580 

aggaaaaagt gcatgcaaat gaggctgaca 11640 

attctggaga gaaatttggg agaaatttgg 11700 

gattttatat tggaaataag agagattctg 11760 

taatggacac aacatgtatc ttctatttaa 11820 

atgtgatcaa attacctccc tttattatca 11880 

ggacagaaat ggtagaactc atccttgtga 1194 0 

accttcactg tttttttcca agtcttaaaa 12000 

aacaaaaata aaaaacatcc aaacaggttc 12060 

ttttacaaag ggcttacaca gccaaaacct 12120 

atgttagtat gcatgtttga tacatgagga 12180 

ccaggcttac acagctactc agcagtaaag 12240 

gtcctgggag ctattacgcc actctgctct 12300 
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cttactgagt gaacaatgtc atcagaacta 
tatgaaatgg attagttgat ttagttacta 
5 aatgagtgaa ggagttgaat taagacaatt 
gtttccttac ttcattctct aatggttgag 
ctaggcatga aagcttcggc tgacccaagc 

10 

ggaactctgt ctctgatgct ggtttctact 
gctgtgctgc agtggaaaca gactgtctga 
15 gcttaatctt gcctagtgta tcttatctct 
tagaaggaac aatcagtcag tcatgagata 
agaccgagaa gaaaagcaga tggtaattga 

20 

tgaaaatagt taaaatgaga aatcaactgg 
atgagactct gggaagttaa atgactgggc 
25 tgaggacgct agaacctggc ttgcctccct 
ctagtctctt taatagtatt gacttggtaa 
gaatgtctta tgctctgctt ttcctcctgg 

30 

atatattaaa aatgtattgt tgaatcaaat 
gctttattta actgaattta aaagccttga 
35 agaatactag actgataaat tgtaattatt 
ttggattagc taagaattct atgtatacag 
tgaacctgag taaagtctga agacctaaca 

40 

tgcctcacag ttactcatgt cctcaaagac 
gacttcttgg gaaaaaacgg agaaatatta 
45 aaaacctaac cattataaaa aagcaaactt 
atgcatattt gttttaaaaa attatttcgc 
ctcttggttt gcctgatact gacttcagca 

50 

agactagcca aatagtctta gacatttccc 
atcacagtga ggcaaaggat acagcttttt 
55 actgatttgc tgtggtgcgc agactctttt 
ttttactcat tagtgcataa aggcaaaccc 
gccggcttca aaataaaatc tttgagtgca 

60 

tttcctttaa acacagaata aatcaacaat 
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cgtacgtctc aaagaagtga gaaacaagtg 123 60 

ctcatcttcc cttctcttct caagaaactg 12420 

gtgaggagga gcagttttgt tttcctgatt 12480 

gtggtcttca tctctatggt cacctctaaa 12540 

atcactgata ttttatttct tgactttcag 12600 

gtgttggagt tctgcctagc tgtgctcact 12660 

cttccctggg tgagtgtgct ggccagcctc 12720 

gcactgtgtt gagtatgtca ccaagagtgg 12780 

cacatgggag gtcatttaaa tggtgatgga 12 84 0 

gtagctgaca agctgagagt tcactggata 12 900 

ttcaatcttc ccattttgtc attgaaggga 12960 

tggcattatg atatgagtct gtacctgtgc 13020 

tctaagcaga aacaatttct gccacaacca 13080 

agggccttta cacatgtgac tggatccagt 13140 

tgattttaaa attctttttc cttttaaaag 13200 

tccaagggat accaccccta tggacttatt 13260 

tttatccaga attttattat agagaataga 13320 

tgaagaaccc agtgatctgc tctacttgtc 13380 

ttggaaaaaa tggcatatcc acatgtatct 13440 

tactttgttt cttctaaagt gtgcttttcc 13500 

gactcatgat gctggatatg aagaactatt 13560 

attggaaagt tgatttttat gataatatgg 13620 

gagtttccta aatgtaagca tttaaagtaa 13680 

tgtcatttat gatatgtgtt cattggggat 13740 

aaagcacagg gctgtaaatt accatttact 13 800 

agaaaactag gcagaatgat gattcaatgg 13860 

cttatactcc tcaattgtga acaggagtta 13920 

atcacaggtg ctactgattt atcccatgta 13980 

cataatgaag tctccgagtg tatgaaagta 14040 

tactttattt taatacttat tttgttattt 14100 

gacagacata aacccaaatt ttaactcaat 14160 
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tttctcctca ccaatattca aacaaaacta 
tatctgttat gaagagacaa ccagttcata 

5 

gaattgtttg agtataacaa acataagcct 
acacacacac gtgtaaaaaa gtatttcaat 
10 ttgtcatatt tccttgctct gaccctctgt 
cagatatcct gatactcaga gagtttctct 
tcagtttctt tgattttagc tttatcttca 

15 

tttgcacccc ccactgacac acaccatttc 
acttttattg atacttattg gataccagac 
20 tgaataagat acatgatttg gcttcaagat 
acaatgaagt tagggatttc gctaactgtt 
tatttgagca cagaataaag aatatcatct 

25 

gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 
gggggagaga aagacagaaa agacagaatg 
30 ttaatcacat ggtgccacct tccgccacag 
tggggtaata tcagaggttc attgtctcat 
agagtgaatt taagagcaga agtttaatgg 

35 

gaggaaggga tcccgaatgg atttctgggt 
gagtctgagg aggtcgtgtc tgatttacat 
40 gccatttaca tagcatggga agtgctggcc 
gggtctccac ctggccggca ccatattgcc 
aaaagggaag agggaacctc catgcttaat 

45 

tggcacaact gctggcattc acctttgcaa 
tggatttttc aggcttcttt tgctagacaa 
50 aagaaaatct tactgagaac tcccttaccc 
ctcctgtgtt atttctcccc tcagaagtag 
cagtgactct ttctgactac ttcctgctga 

55 

ggctcctcct gagttcaatc taacggtcct 
gcctgcagca ccatttaaag tttgcttcta 
60 ttagaataca gggtttgaat atgagtatta 



acgagtgaac 


aaatgctaac 


caggaatgct 


14220 


cttacgtcct 


tatttcaaaa 


taaaaaagac 


14280 


caccttgctc 


atgcacacac 


acgcacacac 


14340 


tttatatgac 


ctccatttct 


ttgaaagatt 


14400 


agcaggggtc 


atacttcata 


ctcactgtct 


14460 


gctctttttt 


tccagtctca 


aatggaactg 


14520 


aatctgtttc 


attaggtctt 


ctccctgtcc 


14580 


ttttatttat 


ttgcctattg 


tggacattca 


14640 


actatgcttt 


taatgccagg 


gctaccaaag 


14700 


atttatagtc 


aaataaggaa 


aacagagatg 


14760 


atttgagcac 


agaataaaga 


cactaactgt 


14820 


tctgcctaaa 


agaatgtaga 


ggataaacaa 


14880 


gtgtgagaga gagagagaga gagagagggg 


14940 


tacttccttt 


gaaaatccac 


ctccccacta 


15000 


gagaggacca 


ttgaagcatt 


gtcatttgtc 


15060 


gccaagaaaa 


ttgaagacat 


ggacacacaa 


15120 


ctaaaagaaa gaagagtgct 


tcctcatgca 


15180 


ttggggcaag atgaggctgg 


ttttatagat 


15240 


caggcatagt 


agattggttg 


gaccagatgt 


15300 


atcccacact 


aatcttttat 


tatgcagatg 


15360 


tgcttttttg 


ctgcacacat 


ggtgacaaag 


15420 


atatctagct 


tccaggtatc 


ccttttctat 


15480 


gcttccagct 


tgcttatcta 


ttcttgcagc 


15540 


gaaatgattt 


gggggctgct 


ttttaataaa 


15600 


tcactagcta 


cttaaataat 


ttctttttag 


15660 


taaccctaac 


tgctgttaga gggtgttgga 


15720 


aaaggggcat 


ctagtgggga 


cagcagctag 


15780 


cagaagaaag gcgtgtccgt gtgtggttct 


15840 


ggccaaaaga gataaatttt acaagaaggt 


15900 


agattaccat 


tattagtggg ggtaccatag 


15960 
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gccataacca tgccagtgga gtttgatacc 
gtttgcctcc accaggtgtc gcttacttta 
5 ttttgcataa gtgtccccca attattaatt 
ttacagcaat tagaattttc caaccggaat 
gtgttgcaaa tagtagagcg catgtatcaa 

10 

acgtctaaaa cttttactcg ccaagatgta 
aaatgtaatc ccatgtataa ttaaaatctc 
15 atatctggct gcgaatctcg agaggaaatg 
aaggtgggac tgagtaagat gggtagccct 
gcttaagatc tcctattttt tgtcattgat 

20 

ttgctctctc agctttccag gctttgactg 
taacttttac tgctgagggg gttgaaagaa 
25 ggaagagaga gagaagggag agctttcacc 
gatcctattt cttggagttg ggcttttgct 
agagaggtta catgctcaac caactcagag 

30 

aggaaagacc atccatacag cattttaaaa 
gaggtattta agtagaccac agcggggagt 
35 gatgcctttt ctgtatggca tgaaaatgct 
accaggagat attggatgcc cttcatcttt 
tcccctggat agcttcctat tctttgggtt 

40 

ttatttttaa gacagatttc acaagcatta 
tctcctgaaa acaatctctg ggcacattga 
45 tgatggagga ttttaaggac tttccattgg 
tgactgtagc catcctgaga gctggaaaat 
tgcagggaag tactgaggtt taatttcttt 

50 

agtccataga tgtcttgagg cttccttgct 
tttccttcag ctacttcatc ttttcccttt 
55 tttcatggaa ggaaaactga ggatgataac 
tgttttgttt tttttgtttg tttttttgtt 
tcactctgtg gcccaggctg gttgcagtgg 

60 

cctgggttca cgccattctc ctgcctcagc 



cgu uagcca u 


u u Cact tyyc u 


ty tdaLdLta 




ccagacgcgn 


CadLatoaoa 


yyaawttt U U 


iouoy 


cxacuauaaa 


ga u ua uaau u 


ana a a f~ 

dy ctucty act uo 


1 (Ji Art 


uccauau cgc 


ay y uy uu duo 


aty UaLaytL 


i fi? nn 


t tcccggaaa 


ggtggrgnag 


Ua.ya.Uaa.UUU 


lOiOU 


gaatttctcc 


tugggguuct: 


augaaguuac 


XOJ£ U 


cctgcaaata 


uguguuaaaa 


agaagugcua 


lb JoU 


uugaaaugac 


aaaaagcauc 


uggugaguua 




cacttactca 


CuuauCuuuu 


at gat tttca 


lob U U 


gtccaggacg 


uucuuctggg 


ctgtcagagg 


1 £ C C t\ 

1656 U 


gagtgtgatg 


tatgcaggaa 


ttaatacctg 


i c c o n 
lfabz U 


cagtgtaggg 


cccttcccag 


c ^tggggtca 


1 croft 


aataccaaat 


ctcctgggtt 


aaacaaaggu 




aactgtgtta 


attcctgctg 


gaagtgagcc 


16800 


atttcttaat 


ttaatagaaa 


atcattggta 


1686 0 


gggctcactt 


tgtctgccat 


ugucacuuug 


lb yz 0 


uatuucatua 


actaacactt 


tattacctca 


n £ Q Q n 
16 So 0 


tctaaccagt 


tggcgaaagt 


atctgtccct 


1/04 0 


ggcatgtggg 


ugaaguc ugu 


cugccagucc 


1 / 1U u 


^9 a 99 a 99 aa 


ggagccacc u 


guucagggga 


1 *n cn 


acaacctgtt 


tggctgtttt 


tggtaagt tc 


1 l ZZ\J 


cgagut.uL.au 






i Toon 


ctagagactg 


gcaaguagag 


uutgccaccc 


i7^n 


auacacucga 


gaaguggccc 


auucuauuuc 


1 / ^ ULf 


auggaaccu u 


CCCayat toy 


a 9 999 c c 9 a 




gctgacttag 


ccgcccga cc 


agcuaaccca 


1 / dZU 


cga ug ncccc 


cacagugLay 


CdCtyLCatC 


neon 


ctgctaattt 


cctggtggtt 


tttgtttgtt 


17640 


ttttgttttt 


tgtttttttg 


agacagagtc 


17700 


cacaatctcg 


gctcactgca 


agctccacct 


17760 


ctcccgagta 


gctgggacta 


caggtgtgcg 


17820 
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ccaccacacc aggctaattt tttgtatttt 
aggatgttct cgatctcctg atcttgtgat 

5 

attacaggct tgagccaccg cgccctgccc 
agtaagaaag tgtctttcct tccaaatggc 
10 cttggagtgt aaatattacc tacctttatt 
atcagttcag ctaattgact gcttgtggct 
tttagagtgg ctactgcata tcctgactgt 

15 

gtctagtctg gattttctag gagagtttcc 
attataactt gttcacagtc atattcaggt 
20 gtgtttaggt gagaacaagt gtttaactca 
taaggagctg gctggctgtt agccaaaggc 
tatgtggggt gtaaatcatt tcccagggtt 

25 

actgtggcaa tggcttggag gcatgtataa 
gttgagaaaa atattagatt agaatttttc 
30 aagaggggag gcctggatta gacaggagaa 
taccttcctt cctttaattt ccagaatcac 
gagccactgg agctggaggt ttgagcccca 

35 

agactggttc tgaaactggt gacttctgtc 
tgctacagtc tggacagggt caaggtgact 
40 ccctgacttg ccacatcaat agcatctagc 
agcctgcaaa acagctactg gaggctctgt 
ggcatcctcc tgtccctatt taaaagacca 

45 

tgctatctgg tcctatcacc tgctgctgta 
taataaactt gttctttagg atgagccatc 
50 ttttattagt gtatctctca gcctttccat 
gtctatcatg gacagtttag aataattaaa 
taatatacac attaaaagtg tttcctttac 

55 

tcagggttgg caaaaagaac tgcttccctt 
tcaccttctc tgtctccttt tctccttttc 
60 ccaaatttct ctgctagctg cagagctgcc 
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tagtagagat 


ggggtttcac 


cgtgttagcc 


17880 


ccacccgcct 


cggcctccca 


aagtgctggg 


17940 


ctggtggtat 


tttataggag 


atccattagc 


18000 


accatgagca 


tggagaacta 


agaaagtata 


18060 


ttgcttaatt 


caagtgctct 


tgtaagagct 


18120 


ggagagagag 


acacactttg 


aaagacgtca 


18180 


ggagcttcac 


agctactatc 


tgtgaagagg 


18240 


ctgagatact 


ctctggctgc 


attggtttgt 


18300 


tccccagctt 


cctctggaag 


gaaagtggct 


18360 


attgtggaaa 


tctctaacaa 


cagagtctgt 


18420 


tccccctaga 


ggacagcaat 


cctgccacat 


18480 


aatttggtgg 


cttctgggac 


cagtagggcc 


18540 


ataataggtc 


cccccaactg 


caactaaggg 


18600 


ctgagatgcc 


cctcacgttc 


atgctatggg 


18660 


gagactggct 


ctgttgtcca 


gaaggaggtc 


18720 


ctggggcttc 


tgtgctgttc 


atgacagttt 


18780 


ggaaccatca 


gtcctattgg 


accatctgtg 


18840 


tctgggagca 


gtctgatctc 


cagtgatcca 


18900 


tcttgctgcc 


tggcactcct 


tcttaaagtg 


18960 


ggatgcacct 


gaggaatcct 


gggctttgca 


19020 


ccttctcttg 


tgtttcctct 


ctttctcttg 


19080 


aagtggccat 


gctcaggagg 


ttctccaagg 


19140 


gtttcctctt 


aatgttggga 


actgcctgtg 


19200 


ccttgactga 


ataaggggat 


aggaagtgtg 


19260 


aaaggctgca 


ggattctcat 


ctggcttttg 


19320 


aggtttggcc 


ctagttcttt 


gtaggccctc 


19380 


catttatctg 


cagaactatt 


ggagttccaa 


19440 


cctatttgga 


atggtgtttc 


cattattcct 


19500 


agtctactat 


aaaaaacatg 


ttgttcatct 


19560 


tgcttttcag 


ctgcggtgag 


ggtttggctt 


19620 
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agaagcagca taacatccct ccatgcgagg 
tctatatacc tattggggtc atcagaaaat 
5 cctgaaatga gcaggaaatg tgaatcctaa 
ggggtaagag tgaagcaggg aaagtgggta 
gtggcatggt tagtgggggt cccagataag 

10 

ctgtctcaga ttgtccccct ggaatatgct 
ttcctgacat ggctgctaaa agagctaggt 
15 tcttgcaggg caaagaaagc ctgtacatag 
tagaaaagat ctaattgttg gataatatta 
ctgttcatcc ccaagatggt aagaagttgt 

20 

aagtctcagg gtcagaggag tcccaatgtt 
aaagacaatc tgttatgcat ctagaaagag 
25 ttagtctcat tccttttggt atgacacagg 
ctactgtttt ctctccttgc ttcctgtggt 
gttgcaggca taacccccca ccccagccat 

30 

tcatgcatac ccatgcagcc ctagttctcc 
tgtatgatct gcatggctcc ccaaaagatg 
35 tttgggcaag actgctttaa tgggaggggt 
atggcccatg ctaaagtgtt tacccttaga 
atccccttac catttaagta ccattctaac 

40 

caggaaccaa atggtggttt tcctgctaac 
atttttctcc taatggttaa aggtagagtt 
45 tggtctcttg tggaggggcg cacaaaggga 
aaggactgac aatgtgccca gtgagaggat 
agaaatacca cgtgcttgcc agatgaaggg 

50 

cttctgctcc tagaaaatca caaaaatggc 
ggcatttgct aatctttcct aacaggacta 
55 ttgcatacag aggataagag atatggcagt 
gaaaattgga tatcctgttg ctgatgccgg 
gaaggcttca gataacacca gggtgtaacc 

60 

ctcttccagc cccatgcaac agctaggtac 
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tcaaacacct gagttaaagt tcggaaagct 19680 

tggcctagtc tccctttatt tgccaaaggt 19740 

tagcacaatt ttcatcaggc atttcttgta 19800 

gttttagaga tagtgcagct ggaggagctg 19860 

ggagactgga agggttgaga cactcagtag 19920 

ttagttttgg ggaatcattt cctatggaat 19980 

tgattgtgca atgcttgcaa gtctgggtta 20040 

gggatcttgg atcatttgcc ctcccatctg 20100 

aaatcaagtc tccaccgcag tgggccaagt 2 0160 

acaatagaat atgagatgct ttttttttca 20220 

ctagaatgca ctcaagagga aattcaggcc 20280 

aactgagaaa aaaaaaaaaa aggcctatct 2034 0 

gtggaggcga agacagtgag ggcatccacc 2 0400 

cccagcacct tgttaaatgt gctgcccatg 20460 

gaaactggag gaactaagga atgggcttag 20520 

gcttctgatt cccctttgac tccctagact 20580 

gatcttggga aaggctatgt aatagttaca 2 064 0 

gtactagatt gaattctata ttttgctatt 20700 

cagcgattcg ggttaacttc tggacataaa 20760 

taaaggcaga agaggtttct ttaaaaaacc 20820 

aggacagtct cagaactaaa attggaggac 20880 

tttttccatt cacagaagga gcataaagcc 20940 

ggaaattgga aaagctagag tgtttcagca 21000 

tcttattcca ctgggtgatg ctgttgacct 21060 

tagaaagagt acagctcatt tcagggagac 21120 

attcccttga gctatattcc cagtttatat 21180 

tttccccaca ccataaaaat ctgtgcagca 21240 

cgcagatgga aaaggaggaa atttgcaata 21300 

atcaagcaat cagaggctgg ggtcagtcca 21360 

ctggcgagaa atcctcattt gctccaaaac 21420 

tccatgaaga gaaactgatt caaaatatgg 214 80 
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ccaacatgcc 


cagtaacctg 


ggggattggc 


catggtatct 


ccaggaagcc 


tgtcatctga 


21540 


5 


gacttgagaa 


cagcaggtgg 


tttgccctaa 


aggcatgtag 


atggccatgg 


gatgcccggg 


21600 


atttttattt 


gatttggttt 


taaaatggtg 


gctaagagcc 


tcaaaatgaa 


tggacagaat 


21660 




tgaggtccac 


ttctatactc 


aggagaagaa 


tgggagtgaa 


gaacaattag 


acaaagtcta 


21720 


10 


aaagatttct 


ttcaattcac 


ccagagtgaa 


gagaatgggg 


tgggggtgga 


taaaaaagtg 


21780 




caataaagtt 


aggatagcac 


aaaaagcact 


ttttgaaatt 


aaagcacttt 


aatattgtct 


21840 


15 


tgagaatcaa 


cttatgataa 


gcttttaata 


ttctcaatta 


tccttacaaa 


cacctttcaa 


21900 


cataagtaat 


catctctaca 


tttagcagat 


gaaaaagagg 


ctcagaggga 


tgacttactg 


21960 




tatccactag 


actctccagt 


aggcagagtc 


gggtattgag 


tggtttgagc 


ccttacagac 


22020 


20 


actaggtctg 


cttagctccc 


gtattccata 


ccaagagggt 


ttgggcaaca 


tctggatcac 


22080 




ggatccaaag 


tacaccttct 


gcctcctagt 


atgtgctaca 


cctaggccac 


gcacctccaa 


22140 


25 


tcccttggct 


tttcttctta 


aaggaaagcc 


ctgggtgatg 


gctgataggc 


agaatattcc 


22200 


atgatcttcc 


taagggtcca 


tgagggatag 


caacctggag 


gtcattgtta 


ctctccaagg 


22260 




ctttcagaag 


ctgactcagt 


ttattacttc 


tcaggctgcc 


aggcaaacaa 


acaaaaaaca 


22320 


30 


ctctttctct 


tctaatgaat 


gaaaagctag 


aagatgttca 


cttgggccaa 


ggaaaataat 


22380 




cagggcgtgg 


taggcagcta 


tagtcctgat 


tctgtggttt 


tccatattta 


ggttcaacct 


22440 


35 


cgggtcattg 


gctgagatca 


gctggacctg 


ggtgtcatct 


gatcactttt 


gttacaggac 


22500 


cagtcctgag 


cctgtgcact 


gaggcctctt 


cagttctttc 


tctgaaccag 


taatacaaga 


22560 




ttccaaatgc 


caggtacagg 


agaaacccag 


tgttggagtt 


ttcttgtccc 


aggtttttaa 


22620 


40 


aagacacgta 


aggacaaagg 


atgtcaagat 


gtagaacttt 


gatgttcaac 


tttctccttc 


22680 




agtcttcctt 


acttgacctt 


ctctccagcc 


ctgaaggaaa 


tataccattt 


ccatgccatg 


22740 


45 


aaattgcccc 


atggacataa 


ttgtggtgca 


ttctgactag 


acacaatcaa 


aacacagcag 


22800 


gacttgcgtt 


tcactaccag 


tccctaaaag 


accacataac 


agctaccaca 


tattcttact 


22860 




gaaagagcct 


ccctgggtac 


ttggactcca 


acactcttca 


tgctgatcat 


gtaaaaactg 


22920 


50 


cagtgttgca 


tgtcctttcc 


cagcttcctc 


tttcccaggg 


actttcatat 


cttgactgtt 


22980 




tctcttgcag 


tactcttgga 


gcttttctcc 


accaacaact 


tgcttttttt 


gttttttgtg 


23040 


55 


tttttttttg 


catttcaaga 


cacatcactt 


tttgtttatt 


ttgttatata 


aattatttgc 


23100 


tttcaaatcc 


attgtttttc 


atgcataaat 


gcatggccat 


gtctatttct 


tttagttaaa 


23160 




ttgtctgtct 


ggtcatgcac 


ttcatccatt 


tatatttttt 


tctgtctttc 


tttctcttgt 


23220 


60 


gccttattat 


agatgttata 


aagatttatc 


ccttgttata 


atttggcttt 


gccttttgaa 


23280 



-206- 



WO 02/062946 

ttgtttatca gatttttatg ttaatttaaa 
ttttatatgt gtagaaattg cttccaaact 
5 tttcttattt ttttaatcac ttggcatttt 
atagtataag ttatagtata agtaatatga 
gaaataaaat tccaagttcc ccaatcaact 

10 

caaaattaac ctgaaaaact agttcaggcc 
attatacaat cctttctttg gaattcaggc 
15 gacctcaaga ttgatggaac agtctcttta 
acgctgaagc ctgctaccag gagtcttcaa 
ttaagccaga cactcccttc tattgattcc 

20 

gccagccaga aaatgtttcg agctacttat 
ttggggcata gcttggtttt atacatttta 
25 aagaggtaca ttggttctgc acagaaaggc 
tggttatagg taggtaagag acaaatggct 
cataggaggc aatcacatct gcattaattc 

30 

ctgcccttgt ccacaaggaa attccttgtg 
tctttatttt aggaatagaa tgggaggaag 
35 ttccttttgg cttagtgact tgggggtacc 
tacttttgag tttacagtac acacacacac 
ccacttgttc tattattact tactgagtta 

40 

tttttaaaaa tacactcaac atttatttac 
catttttttg tcattcagtc tatttcaaca 
45 gcattttaat ttctgacagt acaatcctta 
tgggcctggt gcggtggctc atgcctgtat 
gatcacctga gatcaggagt tcaagaccag 

50 

ataaaaatac aaacattagc cgggcatggt 
aggctgaggc aagagaatca cttgaaccta 
55 caccactgca ctctagcctg ggtggcagaa 
aaaataaaat gcttttggct agtaagactc 
aagttccaca agggttttat ttggaagcac 

60 

aattgacaac cagaaatatg tcagtacagt 
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aatgctgaaa tttttggata 


attaaacata 


23340 


ctaaatgaca acaaggaaac 


agacatcttt 


23400 


cttatttata gtataagtat 


agttatagtt 


23460 


ggcaggacct aatgtatata 


ttttaaatca 


23520 


gatggaccct cctcttggcc 


aagggcattc 


23580 


atgatggaaa gtgggggtta gacacgccgc 


23640 


acagcttacc agcattaaca ttaaaacaga 


23700 


agtccaatag gaaacattta 


caatctattc 


23760 


taagaacttt ggtctccaca 


attccttatc 


23820 


aggtctttag ataaactctt 


tcaatcaatt 


23880 


gatctgtaac ccccatctcc 


agcccccact 


23940 


gggagacatg agatatcaat 


caatatttgt 


24000 


aggacaactc aaagctggga ggggtcttcc 


24060 


gcattctttt gagtttctga ttagcctttc 


24120 


agtgagcaga ggcatgactt 


tgagttctgt 


24180 


agagaggtat gtggcttatc 


ttagtaggta 


24240 


gtttgcccta agaaagtccc 


tgcttgactt 


24300 


aatatatttt cctttcacat 


ttctctcaga 


24360 


acacaaacac acacacacac 


gcacatttaa 


24420 


actgtgtttc tgtctatggt 


ttaaaatata 


24480 


attcattgta gtactacatt 


tgtctagtct 


24540 


caagtaacac atcacttgta 


ttagtacact 


24600 


cttcaacact actctttatt 


aaaatgattt 


24660 


atctagcact ttgggaggcc 


aaggtgggca 


24720 


cctggccaac atggtgaaac gccatctcta 


24780 


ggtgcgtgcc tgtaatccca gctactcagg 


24840 


ggaggtggac gttgcagtga gccgagatca 


24900 


tgagactcca tctcaaaaat 


ataaataaat 


24960 


attccacatg aacataaaaa 


tcattaagtc 


25020 


attatattac atctgaaaaa 


aatttggaat 


25080 


caagtgtcca tttgccttcc 


taaataaaaa 


25140 
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5 



15 



atgtgtatct 


tttatcatat 


aaattcttca 


tcatattttg 


ttttaatatg 


aacactttag 


tctataacta 


ttcattgggc 


ttgaaattaa 


ttacttctag 


ttttattgca 


attgattagt 


tttggaatgg 


atatatggcc 


aattgttgcc 


ttctttagtg 


ttgattcatt 


caagtatttt 


tcatcaaaac 


ttatatttgt 


taaaaatttt 


tactcaggca 


cattttttat 


tagtagtata 



acatattatt ataaatgaat tcctttgtca 
20 gcctaatatt aatacaccat cctttgtctt 
ctttccttac ccttacattc actactataa 
gtcactgtgc tgcttatatc cttctgtggc 

25 

tctccaggaa gcattcaaag ccctgtaaat 
ctctcccaac tgaatctgtc acttctccct 
30 ggtctgctag taagccaaac tgtttcagaa 
gagaatgcat atggctacaa gcacctgctg 
ttcaatggag cctcctcttc caagagaagt 

35 

ctccctatgt actgtgtgac ctttgtgctc 
ggcagggaca ttatcttcca caggtattgt 



acttaataaa 


tattggctga 


ataaaataat 


ccaggataac 


tctagaaagg 


tgggacaaaa 


gtcacttagg 


agtccctccc 


gtgtgaagca 


aacatgcctt 


catggaaaga 


gagtaacagc 


gagcaaaata 


agatgacaga 


gcaatagtag 


cagtgtatta 


gcatatttct 


tttgcacaat 


agcctgtgat 


gtattcatca 


agactcctca 


aactgtttgt 


ctaagtattc 


attaggaaga 


ctttttactt 


ggttgtgtca 


gtgtttttca 


ctctctccaa 


atcatttttc 


cattattttt 


cctcctcatt 


ttccatcctt 


catttccagg 



tgttttcaat taggtgaatt accaaatatt 25200 

atgtttaacc tggttattaa caattaaaaa 252 60 

tcttactttg gtaacaaata tttgtattat 25320 

taacatggcc tgcaattttt gttttattta 25380 

aatgtaccaa ggattcttaa aaagcatgct 25440 

ataattcaaa tcatattttt tggcttacat 25500 

catgattact atgtttttgt tagtttctca 25560 

ttatattatt tgatcataaa attgtaaaca 25620 

tatttaagaa ttttccttag acattttatt 25680 

attattttgc attttctgac ctttatttgc 25740 

taggaacctt tggaatagaa atgggatttt 25800 

tacacattgt ctctatgaca aagcccaatg 25860 

ctctcatttt gcacatccag gatcttatct 25920 

ctgtgataga ataataccaa agtgtttcca 25980 

ctctagtctc tgcacattga tccctctacc 26040 

ttctttgaaa gcatgatttg aaagttaatt 26100 

ttaatatctg acctttgtgc tcccatgatg 26160 

ccatgatgct ccctatgtac tgtgttcaag 26220 

tagtctctag cagtgttgca tgcaccagat 26280 

tctgaatggt agctcatttg gactagaaac 26340 

atggcagtct tgttcacata tacagtggct 26400 

gagaattgtt gctgtacaaa tctgttttca 26460 

aacatggaaa aatatgtgaa tttggaacaa 26520 

acagtgtaat tacttgatat caatgctagg 26580 

tgggtgcaca atagtagcat taaatgccgc 26640 

actcagtatc aatttcctac agcctagaaa 26700 

gccacaattt ttggtgggag gtgagctaaa 26760 

tatattaatc cagataaaga tttctattgc 26820 

ccaaagaatg gcctgctttc ttaatttttc 26880 

gccaagagta aagggttttg atcatttact 26940 
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gatattgttt gtctgtgtcc ccacccaaat 
cacgtttcgt gggaggaacc cagtgggagg 
5 cactgttctc ctgatagtga atgagtctca 
tccctgcaca atctctcttc ttttgtctgc 
ccatgtttgt gaggcctctc cagacatgtg 

10 

tgtaaattgc ctagtctcag gtatgttttt 
acctatctaa attgcaatct gaattggaga 
15 attttttgtg tttctaaatg gaaatggagg 
agaaaatcta aacaaaatat gtaggctaat 
ttcaaaaaat aggaactact ggtttagatg 

20 

tcctgtgtac acctgtgcag tgtattagct 
taggcctatt tcccacctct tagcaaaaat 
25 ttatcatcct gcagtttgga agaatagtgt 
aatcattctg acgaggccat gtaaggttat 
aaagtggggg tgaaaagcac tgattgactt 

30 

ggcggactca gtcctcagag ttacatggat 
gaggaaaatg gaagatggat tctgggcagg 
35 gtctgcccta tgcccaaaaa cactttcttc 
acatgatagg cattcagctt ttcccaagag 
cactgcattg gaatatgaag ggagctctgt 

40 

atgtgtttct tggccagaca attgagccac 
tggtccattg ttctctacaa ccacatccag 
45 agcccactta atatttggta caaatttgag 
aatccatagg gcaattcttg ggtcttccac 
ttgtcagtct aatagaataa attacaaaaa 

50 

gaaaaatata ccatgaaaca tttaaaattt 
acagaatact atttaaaata aatattaaat 
55 atacatataa aattaaataa atgactaaat 
tttgtgatac taaaaaaaag acaattgaaa 
aaaagggaat gctcacaaat ttattttaaa 

60 

aagaaatggg aaaattttca gatatttttc 
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ctcatcttga 


attgtaactg 


ccacaattcc 


27000 


tgattgaatt 


atgggggcgg gtctttcctg 


27060 


tgagatgtga 


tggttttaaa 


aatgggagtt 


27120 


ctccttgtga 


gatgtgcctt 


ccactttctg 


27180 


gaactgtaag 


tccaatacac 


ctttttcttt 


27240 


acagcagtgt 


gaaaatggac 


taatacattt 


27300 


aacagtggtc 


tttgtgacat 


taaataaatt 


27360 


atgaaaaggg 


agaagtacat 


tatcatacag 


27420 


gtatctgaaa 


actacagagt 


tgaagcaaac 


27480 


tttagatgtc 


atggagactc 


cagcccatac 


27540 


ctaccctcta 


ctgtgtgttg gcttaatgct 


27600 


ttccacatca 


agtattggcc 


ttacatccat 


27660 


tgtctgaagt 


ttcaaaacaa 


gactattgat 


27720 


atgtccacat 


tggttcaatc 


attgcaatgg 


27780 


agcctagagc 


cagagcacag 


tggaggcaag 


27840 


ccccagtcag 


aattcacaga 


ctgtttgaaa 


27900 


caacaagcgc 


atgctacagg 


tcactccttt 


27960 


tgatccatac 


ttccaacaat 


gatcacagct 


28020 


gacatcaagc 


cagaaatctc 


atgcacttaa 


28080 


ctaaacaggg 


aactctgtct 


caatagtaac 


28140 


ctccaggtat 


gttctccagg 


aagattgcct 


28200 


catgggaact 


aaaaaattgc 


agcttctagg 


28260 


ggcctagatg 


ccaatcatct 


atagttgttt 


28320 


caaggaataa 


gaaggaaatt 


aaacaaaatt 


28380 


tttaaaaaat 


taaagcatgg 


tgagtagaaa 


28440 


gatatattag 


taatcaaaat 


aaaattgaat 


28500 


atattaataa ttagaagaca tatttaaaat 


28560 


aaataaaaca 


ctatgccctg 


agaagtgagt 




tattattcaa 


aatattcaga 


agattttttt 


28680 


agatcttgat 


ctaaaccaaa 


ccagagcaat 


28740 


aatctctgaa 


aaatcatatt 


taaaacacaa 


28800 
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ggaaatataa tcccacagtg taacatatat 
tatgggtgtt ttaataatat ttcactgcgg 

5 

atcctgtaga acttagtgag ggagcagggc 
ccctcacagt gacctgaaaa cagccaccac 
10 ccaggtatct gccatgcaag gccacctaag 
tatggtttac agacagtccc acccctgaag 
actttggtgg actcagacaa aggcactgcc 

15 

atgcagcaac acagctaggc ctttaacaca 
atgaaactaa taaaactcct gggcataatt 
20 atgattgtca agattttgac cttttttttt 
cccaaaagag cagcaatttg tagacctatt 
cgtagatatc ctgaaagtat tcatctactg 

25 

ggaccccgac caacaggaca ctcaggagga 
cctggacccc attgactcca tttgctgttt 
30 actgagtgga gcctatggca aaagaacaca 
aactccccac cccttaagtc tgggctgtga 
gtaaagggga ataaaaagag tagatttaca 

35 

caagtgacta agataacagt ggtaagtcat 
atgatactgg ctctttacat ctgtgatctc 
40 tgagaaaaat atcagataaa ccccaactga 
tcctcaaaac tcttaacgtt ataaaacaca 
aggggcctaa ggagatggac acctgactgg 

45 

agaaaaagga cattagatca aaatctaagg 
tatcaatgtt tgttcattaa tggtgtatca 
50 actgagtgca gggcacagga gaactctctg 
acttttctag tataaaagtt tatttcaaat 
tattctgcta aaccatgttg tccaacagag 

55 

aaataagatg aactctccaa ttttacatta 
tacacagttt ggtgtaaatt ctcgttcaat 
60 acttaaactc ctttactttg ggagttaagc 
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cattcccaag 


gtgtaattat 


actgagaatg 


28860 


acttcagagg 


ttagcatgat 


aaagaaggca 


28920 


accacgtggc 


cacatcactg 


tagaccctgt 


28980 


caatacagca 


gagcgcacgc 


acctaagctc 


29040 


cccctagcta 


tctgccatgc 


agagatgggt 


29100 


agagctctgt 


ccttctgcta 


tactgatggt 


29160 


caacaaggac 


tagacaccct 


gttgatccac 


29220 


gacaagatac 


aaggacctaa 


tgatacccaa 


29280 


cagaaaggag 


caaagcctct 


cattgcaaag 


29340 


ccctccccac 


tccctcaaaa 


aaaaaaaacc 


29400 


tggctactgg 


cagcaatccc 


aaatacccca 


29460 


gcttacaata 


aaggccacta 


ctatggagtt 


29520 


cctccaaagg 


gccacccacc 


ccacccttct 


29580 


gaatcacagg 


tctcagcaac 


ctctcagttc 


29640 


caggccatcg 


ggctttggat 


tcataaacgt 


29700 


atagtgactt 


cctttcgaag 


aacacagaat 


29760 


gtgaagaagc 


ctgacataca 


ctacctcaat 


29820 


gttgatagtg 


tggactgttg 


atgggatgtg 


29880 


cttccacaaa 


acccaaacac 


cagtctaatc 


29940 


gggagtttct 


acaaaacacc 


tgaccagtac 


30000 


aggaaagtat 


gaaaaactgc 


cacagtctag 


30060 


aatgctgcac 


cttggatggg 


atcctggaac 


30120 


aaatctgaat 


aggacagtat 


gaacgtcagt 


30180 


tactaatgta 


agatgttcat 


aattggagaa 


30240 


ccatcttcac 


aactcttctg 


taaatctaaa 


30300 


tgtttaattt 


ctcttttgtt 


tttcactctg 


30360 


ctcactaaaa 


tggcacatgg 


tgaattcaca 


30420 


tcaaccatat 


acacacacaa 


tttgtataca 


30480 


accaggttcc 


caggaagaaa 


taaaataaaa 


30540 


aaacactctc 


aaaattattc 


tgcccccatt 


30600 
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tttctgtttt gctacatttt gttcagagat 
attttttttt tattggcatt attccaactt 
5 agagaagttg aaaccaaaat caagcccctt 
tagacctcat acttaatcta gacctatgat 
ggagtatcta aagaaaggag ggtagatgga 

10 

gatgacaata aactccagat ttccagaatc 
aattcaggca ccttttcatg ctgctttcat 
15 tatgttccct ctcagagcat tagagcctta 
ctctgtaaaa tatttgaaga gatttattct 
cacagccttc aggaggtcct gagaacatgt 

20 

tatacattct agggaggcat gagacatcaa 
gtccagaaag gtgggacaat ttgaagcagg 
25 aggtaaattt aaacattttc tggttgataa 
aatagaaagg aatgtctggg ttgagataag 
gatgaagcct tcaggtagca ggcttcagag 

30 

cttttttttt tctttttgag atggagtctc 
acaatctggg ctcactgcaa cgtccacatc 
35 tcccaagtag ctgcgactac aggtgcatgc 
aaggtttcac catgttggcc aggctggtct 
cttggcctcc caaagtactg ggattacagg 

40 

aaatgtttct tatcagactt aaagtctgtg 
gcatgtctca accacacttc ccatcatggc 
45 agagccctgg ctgaggagga agtccattca 
gtttacattg tggtactttt gcaaagtttg 
tcactcttga ggcacttctg tcccattggc 

50 

tctctggtct ctgtgttcct ggcatcctca 
acaaaagtcc tttgccacag tagctatggc 
55 ttctaaagca aaccttacag gcacaatttt 
gaacacctct gagagaaagg ctgaaccaaa 
ccttggaggc tcaggacagc tagtcacggt 

60 

ctcctcagcc tttgaactgt caggcctgag 
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gtttccttca 


gattctgcat 


ggggttgggg 


30660 


ttcttagtac 


ccgtttcaca 


tttgcaaaat 


30720 


aaacaattag 


cccagggaaa 


attacgtttc 


30780 


attatttcca 


aacaccaaag 


cagtaactca 


30840 


gaacaagaaa 


ccttgatttt 


cagcctcaag 


30900 


aatgatttgt 


cttaaaatca 


gttgacgctg 


30960 


tggaggatgg 


agaaatccac 


gtgtgcatat 


31020 


gagacattat 


tggtgataaa 


aagtgtcaaa 


31080 


gagcctaata 


tgagfcgacca 


tgacccacga 


31140 


gctcaaggct 


gttggggtgc 


agcttgcttt 


31200 


tcttatacat 


ttaaaatata 


tattggtttg 


31260 


aagtgagtga 


tggggcagct 


tccgggctat 


31320 


ttggtcgaat 


ttatttaaag 


acctggaatc 


31380 


aggctatgga 


gaccaaagtt 


ttatcatgca 


31440 


ataattggtt 


gtaaaatatt 


tcttttcttt 


31500 


cctctgtcac 


ccaggctgaa 


atgcagtggc 


31560 


ctgagttaaa 


gcgattctcc 


tgcctcagcc 


31620 


catcacacct 


cgctaatttt 


tagtagagac 


31680 


ccaactcctg 


acctaaagtg 


atccacatgc 


31740 


cgtgagccac 


cactcccagc 


catggctgta 


31800 


ttgatgttaa 


tgctggagag 


gtataaagag 


31860 


ctgaaccagt 


ctttcaggtt 


acattttaaa 


31920 


gatacttggg 


gccttagaat 


tttatttttg 


31980 


gatatttgat 


ggtggtccca 


ggtgtcaaag 


32040 


cttgctcaaa 


acctcctgat 


gtttttttcc 


32100 


tgttgcatca 


agtgaaatct 


gtctctagtg 


32160 


cactatttct 


ttcaatgcac 


tcacttaggc 


32220 




LLdLLy LdctL 
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gctgctatga 


gggcctaggg 


agccagtggc 


32340 


gccactgtgc 


tccattctag 


aactgagcat 


32400 


tgtggtgctt 


catacgaaaa 


tgctgtagta 


32460 



-211- 



WO 02/062946 



PCT/US01/48437 



cttcagagaa acagaaatgt ttcttttctt 
tgaccctaga attctgcatc cttttaaagc 

5 

ctgctaacaa ttcatctgct tcttaaatgg 
gatgtggttc attgattaca ctatattctg 
10 gttttgcttt ctggcctgga catttttgtc 
aagatgctta gtgttcccaa agcatccatt 
cagtgtccca tgacccccct aaatgacata 

15 

ggacttgatt gttgacttca aaccacccaa 
tttcttggcc tctttgccag ctgaagacct 
20 cttgggccca ggcttactgc aggagatgcc 
tcttgcactc cagcacagat cccatggcca 
gggggtgtgt gagtaagcaa gtgtggggtt 

25 

gtgggctctg tgcagggctt gcagctggac 
actctggcat tcagacaaga ggaacatggt 
30 aagccccaga gggggtgttg ccacattcta 
cagtgaacag cggcatgtta acagctctgt 
tctggggcta gctcggtcct gctgctactt 

35 

gcgaagggga aagggccaca gtgttacagc 
agttattgtc ccacgttcaa gaagaatgag 
40 tggggaagaa ttttattaag tgacaaaaca 
gtcccccacc caaagtcagg tggtctctct 
gagtcccagg cttttatggg ctcagaatgg 

45 

gcagaaaagg ctaaaacaaa gacaccactc 
ttatggaagt gtaggtacat gtaaaacagg 
50 gagaaaagca tgccaaatgg gaagagaggt 
ttgtagcttg gctttcaggc tttaaactct 
cctgcccctg tctgcctacg attgtctacc 

55 

caagtacttg ctttaattta tttttcaaag 
gaactaaggg actaggaata taaataatac 
60 cttgcaaaat aagatacagc aaatgctcag 



tggagcattt 


atttctgcag 


gatttatata 


32520 


attcccacat 


tttggaaggc 


ttttgctaag 


32580 


atccatttcc 


atatctgttg 


atgtaaagtt 


32640 


ttggtttgtt 


ttttgtttgt 


ttgtttgttt 


32700 


ttcctaacat 


acctcttaca 


ttgattcttc 


32760 


gttcaaacaa 


ggcttgtcca 


cttcatctcc 


32820 


ctactgtgtt 


accagtgcgg 


aaccaacact 


32880 


tcaaggctac 


aacagatgta 


acacaggatt 


32940 


ccagccagtg 


acacccctgc 


ccaggctttg 


33000 


ctatctactc 


agcctggcag 


gccacacctg 


33060 


ctgcaactgt 


gcaataagcc 


tctggtggga 


33120 


gggccagcta 


tttcaatcac 


cggcacagga 


33180 


caggcatgtc 


acaagcaact 


cccgcagtgg 


33240 


ggtacccagg 


cagggatgcc 


catatccctg 


33300 


atagctcttt 


tagttctgcc 


atctgcagcc 


33360 


cagtcccatt 


gccctaatcc 


tgcccatggc 


33420 


cctgttgcat 


ggggtggctg 


ccctccactg 


33480 


tttttttgta 


ctcacattca 


gtgggtcctg 


33540 


gttatgttga 


caattgaagg 


gtgaggaggg 


33600 


actctcagca 


gagagggggt 


tacaaggatg 


33660 


cacccagagg 


tgggcagttc 


ccatgtggct 


33720 


aggagtgcag 


gctgattagt 


ttgtgagtat 


33780 


aaaagtgggc 


atgacaatgt 


aaaaggccaa 


33840 


ggaagggtgg 


tgaagggtgg 


ggaccaatca 


33900 


tctccatcca 


gcctcagatt 


tatctgagac 


33960 


ctttgggtta 


aaggtggtgt 


tcaccggact 


34020 


tcctggcgct 


gtcagatgca 


tcaatctccc 


34080 


ttcctagtct 


ttgaacatat 


gtttttagga 


34140 


ccctttcttc 


tagtcccata 


tttacagtga 


34200 


gtgaaaaacc 


aaaataccac 


tttattcctg 


34260 
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atatgctgaa ggaaaagcat tccttcagtt 
caaccagtca ctggcagagc tgggtcaccc 
5 aaaccctagc aagggaaggg aagaacagat 
caaaacaaat gaagaaacac agatgggctg 
ctctgcaact ttgtgcagca ttcctccttc 

10 

gaaacaggtg tgtttctcta taggaagttc 
atgaagttcc ctcctaactg tggctacctc 
15 gggagagctt tggggctacc tgcagcattt 
aaataatcat gtaaaatgat aatatcactt 
ttttattatt ttctgcattt ttatctatgt 

20 

ttataaatta ttctgagtca tttatcatct 
ggaatacaga aaagaaaaag aagaaaatat 
25 ttattattta atgtaccatt tttatacagc 
aatatttata gttgtttttc ctcatttgac 
cactgtagcc ccctcagaca acagtgagta 

30 

tcataaagtg caaaggaact ataccctcca 
gagatacttc atattaacca agcagtagat 
35 gagaaacctc atgaaaacat agcacatttt 
ttcctttccc ttctcacccc atgttcagac 
aagacctaag tttcaggttc ctgcaatgac 

40 

agtaagtatg gtggcagagt tgcagctggc 
gaagcaaagg aagcccccac gttcactcac 
45 ttctgccgca gcttcaaaag ccaaggtaga 
ggagcttagg gccacttaca gatcctgaaa 
ctgatatcca acccagcaga ccagacgtat 

50 

ttcaaacaca caaaaaaata tcttgagcaa 
cttttcagaa tgaagatagt taaatgtaat 
55 tatatatcca gcacttagaa caaagcctga 
cctttccttc ctcttccatc tttccctgcc 
agagctagac tacccttcca tagccagaca 

60 

ttccatagcc agacaccctc ctcacttcct 
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CyddattCaa 




34320 


Cdyycccccc 


CCdCaagagy 


dyy ^d^i>LUL 


34380 




r« i~ anr'a it f*r r* 
LLUayLdyy l 


acta t - n t" f t~ 0 


3444 0 


— i *-*f r-k o n ^ n 3 o 

dyCCaCaCaa 


rtrrTaanrrn 




34500 


cccgaagaga 


dCCdyytCdd 






cgccggaagg 


aaaugcgagg 


c t-yydydcdy 




tggggaagga 


ggg a 99ggg a 


ggagy ayaca 


I4£ftf> 


tatxtctgta 


gagaggaaaa 


aagcgaaccc 




gaatcactgg 


gggaaatatg 


uagauaccug 


"\Anr\C\ 
flUU 


gtataaagtt 


ttggaattag 


catataaaca 




caccttxcta 


cacacccgac 


tCttLCdyda 




aagtctccta 


cgaaccccaa 


_rrf- h^fhhfa 


J** Z70U 


4- _ A- _. *— _ _ 4- — i. 

tctgaaatgt 


ataaaagtcc 


cugaguggca 


1 c; n a n 


cttcttgcct 


4_ _ _, 1- — _ f. ki. *_ 

tcctcctttt 


4- 4- _ 4- _. 4. _ 1- 4- 4. 
ctgcgugtut 


lUU 


ctcttttcag 


gacaaaatgfc 


caggtgtgag 


O 3 lO U 


gaacgatggg 


aaggaaaact 


ttctgcctaa 




4- 4— _ ^ 4- _ 4- .m — 

txcttctcaa 


acaaatggaa 


agcaattgaa 




tgtxtcccaa 


cagttataaa 


caacccagcc 


1 t^i a n 


accatgagtg 


txtcccaaac 


ttCyyaaaCa 




_ 4- — 4- i- f-~, ^ 4- i~i -« 

ctgttgctga 


catxctacug 


cagccgucag 


j j4 0 u 


ttggaagatt 


4_~,4_4^^+.4---4-4- 

tatccccact 


cccaaggaca 


^ ccoft 


tgaaaataac 


aat tggccat 


ctgcatttgt 


J j juU 


gagcctgctg 


gcgcgggcaa 


*m ^ ^» ¥~ ^ 

gtcugytctc 




cccagggtca 


gcytayycLL 


ct y ct ct ay y l.l^ci 


J J ' uu 


gagcactccg 


CCCayaoyCC 




J J / Du 


cttcctcaac 


4- 4- 4- H f nrf^irft- 

Ctcctggggt 
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CttCCLCLCC 


JJOZ u 


agtaccccat 


ggcgacagga 


Cyaydaaata 


JJOOU 


caatgtaatc 


tcatcaagac 


ttaagagctt 


35940 


tcgcttccct 


agatgataga 


gctagatgat 


36000 


ccagagctag 


atgatagagc 


tagaccaccc 


36060 


ccctccaaca 


cttgcccaca 


ggcacaccta 


36120 
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gaatcctcaa gaccactcaa gtggcttcct 
gtaagatgtg gagggaatta tgcactcgat 

5 

catatgcttt cttatagctg ggggcacatg 
acacccttac ataagtgtgt gctgggggaa 
10 tttgtgtact tttcacaatt gcccagcaga 
ttctgtataa ttcagcccta tgtggcagtg 
tatataacag agcatagaca ggagaggcat 

15 

aagagatgac acagaattgt gccaaggggc 
aggtaattat tcaggtagaa tttagatctt 
20 tctgttacca atactgttta gaattgttta 
tgttattgtt gtaaaatttt ctacagacca 
accacaccca ccacccgttc ttgttttgcc 

25 

aataaaaccc cattcaccca agtagaaatt 
accagattat aaaacattaa agattgagta 
30 aagaggattt cattagaaac agaatgtcat 
aggacattgt tgatgttttg tgtggtttcc 
ggaacctccc cctcatctcc agctcagaga 

35 

atgttgtaga tcctctgcag tttagggaaa 
aaaccaaaga atgctaagac ctggactgtc 
40 tatataataa atcacccaaa cgtagaggtt 
gggactctgg gcatgactta actgagttct 
aggtattggc agggatgagg cctcaaatga 

45 

cttacaatgt tattgctaga attcagttcc 
ttgccaggtg ttggctcgag gtctttgtag 
50 aagggctgct tatactctct taccatatca 
ggccaacaat ggtgagaatg agacaccagt 
atcatataat cacataccca aaatcacata 

55 

ttcaaagcaa gtcacagatc ctgcctatat 
agaccaagag gcaggaatca tctagccacc 
60 gcaagaggag tagattgtgt gggggtggtc 



ggtaccactc 


ttgctagacc 


aaagaaaatg 


36180 


tcttcatatc 


ttcgatagtg 


ttaaatggtc 


36240 


catttacaca 


aacacatgca 


cacacacaaa 


36300 


agctatgcaa 


ctctgaggct 


ccaaccactt 


36360 


tggcagcaaa 


gtgtcctgag 


gaactacttt 


36420 


taaagctaga 


ggctaaattg 


tccctctaga 


36480 


tctgttcagt 


ttcatctgtg 


gcccgaagac 


36540 


aggacagaag 


gaggagtcag 


cccccggagc 


36600 


tttcgttgtt 


gttgctcctg 


agagcagtat 


36660 


aaaatggatg 


ctaaaaagca 


tatggcagtt 


36720 


tgcttctcct 


cattgtctgc 


acctgaagag 


36780 


atagagatga 


caatcgctcc 


taatttacac 


36840 


aagaaaaaag 


agccaaacat 


ggttttctca 


36900 


aaagagtaag 


aaattgagta 


agactgaagg 


36960 


ttggaaactg 


tactggagaa 


tttcgttaga 


37020 


tccctcctgt 


cactatagga 


gaaggaagga 


37080 


gagaggctgg 


aatgggaccg 


catggggatc 


37140 


atgagaaact 


tgagtccaac 


acaccaggag 


37200 


tactctatag 


ccagtgtttt 


tatctattgc 


37260 


taacacagca 


cacctgttac 


ctcacaggtc 


37320 


ttgcttcaag 


gtctcaccaa 


gttccattta 


37380 


ggtttgacta 


gagaaggatc 


cacttccaag 


37440 


ttgcagttgt 


aggataaatg 


gtctcatttc 


37500 


gatattggct 


agagaccact 


ctcaattcct 


37560 


agactgcaac 


atgacaaatt 


ggtttctcaa 


37620 


aagacagagt 


ttacagtctt 


ctgtgacatg 


37680 


catcctgtca 


cctttgcagg 


attctggtta 


37740 


ttacagggaa 


ataactacaa 


aagggcatga 


37800 


ctagagtggc 


tctcatggtg 


gcacaattaa 


37860 


tgcacttcca 


cctgtatgct 


gtgtgaaaga 


37920 
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gattctacat 


ggatggtctt 


agagcttgct 


ctgaaaagaa 


atcaggaggg 


agcaaggaga 


37980 




ccttgaagag 


aatgaggctg 


gagattttct 


gatggacatc 


cagggacagg 


acataggata 


38040 


5 


atcgtcagct 


gccagctaag 


tccatgagga 


tactgagagg 


aagcccagcc 


aaggaatccc 


38100 




agtgatggaa 


ccacagatgc 


attcaacatc 


caccagcctg 


catagcactg 


acatcgagtt 


38160 


10 


gggaaacggc 


caatcgagag 


agactttttc 


tgttcctctc 


tcctcattcc 


tgcccctgac 


38220 


tctgtgagga 


tcaggaatcc 


ctcttagcaa 


tcctcagggt 


gaagagttag 


agaagaaata 


38280 




gcactgaccc 


ccactttaca 


tgtcagctgc 


cctgcctaaa 


gcaggccaaa 


ctgaggaaaa 


38340 


15 


gaagaaattt 


taatatgaat 


gacagttgga 


ttttgatttt 


tgcataaatc 


tatattttaa 


38400 




ttactaaagt 


gactgaggaa 


ctcttctatt 


agctgacagt 


agtaacatct 


tagatttcaa 


38460 


20 


acaagcagca 


ggggagagct 


agccctactg 


aggatcttag 


aaagaacagt 


tgtggggcca 


38520 




ctcatgcctg 


taatcccagc 


actttgggag 


gccgtggcgg 


acgaatcgcc 


38580 




tgagttcagg 


agttcgtgat 


cagcctggcc 


aacatggtga 


aaccctgtct 


ttactaaaaa 


38640 


25 


taaaaaaatt 


agctgggcat 


ggtagtaggc 


gcgtgtaatc 


ccagctactc 


agaaggctga 


38700 




ggcaggagaa 


tcaccaggct 


ggtctcaaac 


tcctga 






38736 



30 <210> 79 

<211> 104644 

<212> DNA 

35 

<213> homo sapiens 



40 <220> 

<221> genomic__DNA 

<222> (1) . . (104644) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 



45 



50 



55 



60 



<220> 

<221> MS4A7_initial_coding_region 

<222> (17493) . . (17639) 

<220> 

<221> MS4A12_initial_coding_region 
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<220> 

<221> MS4A5_initial_coding_region 

<222> (64028) . . (64180) 

<220> 

<221> MS4A7_coding_region 

<222> (19439) . . (19573) 

<220> 

<221> MS4A5_coding_region 

<222> (65149) . . (65277) 

<220> 

<221> MS4A5_coding_region 

<222> (66764) . . (66820) 

<220> 

<221> MS4A5_coding_region 

<222> (68118) . . (68270) 

<220> 

<221> MS4A5_coding_region 

<222> (82002) . . (82109) 

<220> 

<221> MS4A12_coding_region 

<222> (95210) . . (95347) 
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<220> 

<221> MS4A12_coding_region 

<222> (96148) . . (96204) 

<220> 

<221> MS4A12_coding_region 

<222> (97865) . . (97981) 

<220> 

<221> MS4A12_coding_region 

<222> (100890) . . (101000) 

<220> 

<221> MS4A12_coding_region 

<222> (101182) . . (101283) 

<220> 

<221> MS4A7_coding_region 

<222> (21068) . . (21124) 

<220> 

<221> MS4A7_coding_region 

<222> (23741) . . (23947) 

<220> 

<221> MS4A7_coding_region 

<222> (27037) . . (27138) 

<220> 

< 2 2 1 > MS 4 A7 _c od i ng_r e g i on 
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5 


<400> 79 








nnncttgagt 


caggatgtgg 


cttttcctta 




agagaaaggn 


tgntacggag 


agaagcaaag 


10 


ccaaggtttg 


tgggaaaagt 


ggntacagtt 




gagagaagca 


cagggtctaa 


gcccatgtgc 


15 


tngaaggcat 


atgcttgtca 


tattgaacaa 




tgtcaattga 


aagcttattt 


tatgttcgcg 




tncagtttgc 


ttggcctttt 


aaagtgctgg 


20 


tgaaactgct 


tttttgtgct 


gagtcagttc 




agtttattga 


tctgggtggt 


gccagctaat 


25 


tgctcttagg 


agcagtttag 


gaagggtcag 




aaaccataat 


ttctaatctt 


gtgtctaatg 




gcaagaaaga 


ggtctgcttt 


gggaaagggc 


30 


taaactaagt 


ttctcccaaa 


gttagttcag 




ggaggttaga 


agcaaaatgg 


agtcagttaa 


35 


ttttgcaaag 


gcggtttcat 


atcggattca 




taaagaggaa 


ggcattttgt 


atcaataaaa 




tgagacggag 


tctcgctgtg 


tcgcccaagc 


40 


gcaagctccg 


cctcccgggt 


tcaggccatt 




ctaaaggcac 


ccgccaccgt 


gcccggctaa 


45 


gtttcaccca 


tgttagccag 


gatggaattt 


attttaacca 


acgttaataa 


tctaaatacc 




taacaaaata 


ttatatatac 


cccataaata 


50 


atctcaaatc 


acatgtgtat 


tgggtcagat 




aagatcttac 


acttgtgtta 


aagaacaaat 


55 


aaattgaaaa 


gctgagttta 


tgtgtttgtg 


ctctttccct 


gtggggaaat 


tcattaagta 




ttcaaactga 


atagaaagag 


gtttcataat 


60 


agtcttcagt 


ctttaggctg 


ggaaagtgag 



aatcacttnt 


atgggtgtgt 


ggcagggaag 


60 


gctttatagc 


tgcttcttcc 


ctttctcctt 


120 


tgataatatt 


tctgtttgtg 


aatgggtgga 


180 


ttttgcttca 


tgccttgtca 


nttaggaaaa 


240 


atattgattg 


tcaattcaag 


atcaattgct 


300 


caggtcttaa 


tagataggtt 


ttccaggtga 


360 


gattataaac 


atgagccacc 


tcacccaccc 


420 


ctaggtgggg 


gccacaagat 


cagatgagcc 


480 


ccatcggggt 


ctgcaaaata 


tctcaagcac 


540 


aatcttgtag 


cctccagctg 


catgactcct 


600 


ttagtcctac 


aaaagtaatc 


tagtccccag 


660 


tgttactgtc 


tttgtttaaa 


ctgtaaacta 


720 


cctactccca 


ggaatgaaca 


aggacagctt 


780 


gttaaatgtc 


tttcactgtc 


tcagtcataa 


840 


gtgatgaaaa 


gattttatct 


gaacaccact 


900 


taatttttca 


aaaaaaaatt 


tttttttttt 


960 


tggagtgtag 


tggcgcggtc 


tcggcttact 


1020 


ctcctgcctc 


agcctccaga 


gtagctggga 


1080 


ttttttgtgt 


gtgtttttag 


tagagacggg 


1140 


ttttaaattt 


taaccaacat 


tatttttaaa 


1200 


ctgacttgat 


cattacacat 


tctatgcatg 


1260 


tgtacaaaga 


tcatgtatca 


attaaaaaga 


1320 


gttggtaaaa 


tctgaatccc 


attttctgac 


1380 


taaagatcac 


tagagctata 


aatagaatat 


1440 


tatgtgccca 


gtaaggaaag 


agccccgtgg 


1500 


ttcaccagct 


gggggaaatt 


caggggtttt 


1560 


agttatgcaa 


attgaagact 


gaagactagc 


1620 


ttgtttaaaa 


caagtcctgt 


tgtttataat 


1680 
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cacttaccag tttcggtcaa ctgaatcagc 
tgtttcctaa aaatattttc cttttatact 
5 aaaacttgag caatacattt tgttttctaa 
tgtgtgtatg tgtaaaaatg aaaaataaaa 
aaaaagatgg caaagttgtt tcattcccat 

10 

ttagaaattt gttcaagtca taaaggtatc 
taggccaact aagaattcct aagcctagct 
15 caggtcttgt aactcagctc atacacgacc 
acaaattagg ctaacagcag gaagtaaaga 
cagtgggagg gacaatgatc gggatataaa 

20 

cctttgagtc ccctcccttg taagggagct 
attgctcttc tggtccgtgt ttgttctggc 
25 tgcagtttgc cgccgtcgca gaaccaccgc 
gtcccctgcg ctcctgatcc agcgaggcgc 
tgcacggcta agtgcccggg ttcgtcctaa 

30 

agttctcttc cgtcatccac ggcttccaat 
cttccattcc ttggaatccg tgaggccaag 
35 gccatctttg gagcagccca ccaccatctt 
aatctttgag taagttcaca attgggttat 
gataaatcca cacactccaa tagaaattat 

40 

gttaggcatg gtgtccccat ccaaatctca 
agagacctag tgggaggtga ttagattatg 
45 atcatgagtg aattctcata agatctgatg 
tcacacactg tctcctgcca ccatgtgaag 
atgattgtgt tgtaaatttc ctgaggcttc 

50 " . 

acctctttcc tttataaatt acacagtctc 
agtaatataa tagtcaacat aatttgtctt 
55 aaaagaatga ttactttatt gagaagtggg 
agccttcttt gtatataaaa atggcagtgt 
gaattttgct ggaatttgtg ggaatgggct 

60 

cccttctact gctctcctct caccaccgtc 
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tgcagtgaga 


aaaaacattc 


agttttgatt 


1740 


caaaattagt 


tgctaaccaa 


aattaaagaa 


1800 


atgccactaa 


aattttatat 


gatataggag 


1860 


tgtttccaaa 


tgcgttttag 


taatatcaac 


1920 


tctctgctac 


catctggttt 


aagaaaatac 


1980 


attttgagag 


acaggactag 


ctggatttcc 


2040 


ggagaaggtg 


acggtaccca 


cctttaaaca 


2100 


aatcaggtag 


taaagagggc 


tcactaaaat 


2160 


aatagtcaaa 


tcatatattg 


cctgagagca 


2220 


cccaggcatt 


cgagcaggga 


gcggcaaccc 


2280 


ctgttttcat 


tctattaaat 


cttgcaactg 


2340 


tggagcagag 


ctatcgcttg 


cgtccaccac 


2400 


tgacttctac 


ccctccggat 


ctggcagggt 


2460 


ccattgcccc 


taaaggctcg 


ctattgttcc 


2520 


tcaggctgaa 


cactagtcac 


caggttccac 


2580 


agagctataa 


cactcacagc 


atggcccaag 


2640 


aaccccaggt 


cagagaacaa 


aaggcttgct 


2700 


gggagctcta 


agaacaaata 


ttttcagata 


2760 


ctgaaatatg 


caaggaatcc 


tgttcaccaa 


2820 


cttgcaaatt 


caatgtcgat 


agttgacatg 


2880 


tcttgaattg 


taatccccaa 


gtgtttaggg 


2940 


gggacagttt 


cccctatgcc 


cattcttgtg 


3000 


gttttctaag 


cagtagtttt 


tcctgcactc 


3060 


aagtcccttg 


cttccccttt 


gccttccgcc 


3120 


cccagccacg 


tggaactatg 


ggtcaattaa 


3180 


aggtatttct 


ttagagcagt 


gtgaaaacag 


3240 


tttattatct 


gatctcccca 


tttttcttat 


3300 




dLtaagagat 


s*i a t"a h t* t~ 
^aLdLL LvLv 


J J D u 


gacatagctc 


tcgctaatga 


gacctaagaa 


3420 


aatttcttat 


ttaggcatag 


cccttcctct 


3480 


cttcttttcc 


ttcttgttct 


tcatagtctt 


3540 
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tcaaaagaaa aggcaaatag aaagctgaca 
cccatggaga tgaaagctca ccttatggag 

5 

acactgatga tactgaaatg gccattgcaa 
agagatctga cctaaccaac ttcaccttgc 
10 ctgggcctag gccaaactaa ctttgggagg 
tatctccttt ccaaaacaag accccttcct 
aaccaattag ccacacaatt agaaattatg 

15 

agttctcacc ctccccagtc gctcctgggg 
gtgcttcgga tattttgcag accctgcaat 
20 taaactggct catctgggtc ctgtggcccc 
agctttgatt ccctgtgatt ttatcttcga 
ccccttactc accaaattat ccttaaaaaa 

25 

ttgagtaata ataaaagtct agagggtctc 
ctctctatgg caattcccct gtcttgataa 
30 aacccactga gcggttacaa tatcgtaggt 
gacttcttga aaaagcctgc atttctaact 
tcaaattgat atactaatct aggtattttt 

35 

taaacttcaa gttattctaa tattaaaagt 
ctcagtcacc tttccccacc atttagaatg 
40 accttaaaaa tctcctggtt ttattgattt 
acagaaagcc atagtgactc acccagattt 
gaccaacatt gtttcttata tgctgtgttg 

45 

ccatttattg tcaatactca gcagaactat 
aatggaatat gtatcagcat tgcaaacatg 
50 tttcatcagc ttggattctc aaggtctcat 
gtcttatggt gatgctggac atcagtgtaa 
attatcgggg aagccaaagg tttacctagg 

55 

ctcagtttgt ttttgtgaag ccttcgtgat 
ccggattcgg tttctagctt gtacaattaa 
60 ttttgtcagc agagtattat ttgtattggg 
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atggattatc 


caacttgtta 


ccatacagct 


3600 


aactgagcac 


aaaaacctga 


gaatctgaaa 


3660 


aattctggct 


gagagcatta 


ttacagtgaa 


3720 


ttctaacctc 


caagccatcc 


ttgttcattc 


3780 


aacttaattt 


acagtttgaa 


acaaagacaa 


3840 


gcctggggtc 


tagactgcct 


ttgtaggagt 


3900 


gtttaggagt 


catatagctg 


gaggctgcga 


3960 


ataacatcac 


tattgtaaaa 


cctaagatca 


4020 


tgatggatca 


gctggtacca 


cccacattaa 


4080 


cactcagaaa 


ctgactaagt 


gcaagagggc 


4140 


cccaagcaat 


cagcactccc 


aactcactgg 


4200 


cctaatccct 


gaatctttgg 


ggagactgat 


4260 


ccatagagtc 


agctgtgcat 


gaattaaact 


4320 


actggctctg 


tctaggaact 


gggcagagag 


4380 


cttgttaatt 


agctctagac 


agttt'cccta 


4440 


ggtgaagctt 


tggcattaag 


gaatgcaatc 


4500 


agaaataatg 


taattgtaca 


aatgagcaaa 


4560 


gaggttaaaa 


ttgttttcac 


taataattac 


4620 


caccatttgt 


ttaatgctat 


agctgggatt 


4680 


ctagtattta 


atgggtgaag 


aaattaatct 


4740 


atgattcaga 


cttagagttt 


ataagtccaa 


4800 


cctacagggg 


tccatagtac 


ctgttctcta 


4860 


taatacctca 


gggaagcaag 


attagtaaat 


4920 


agctatttca 


tgaaggccag 


ggtgagagca 


4980 


aaatgcttta 


atttgtaact 


gcatcaacat 


5040 


cacagacata 


gacaattacc 


ttcaaatcca 


5100 


aaatattacc 


tttgccagga 


ctaggaacgt 


5160 


tggcacaata 


tgagtttaaa 


gtatggagaa 


5220 


cagctgtgta 


aacaatgact 


cactcaagaa 


5280 


ttttgtttgt 


ggggattgtt 


ttgtttgcat 


5340 
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atatatttgt gtattttatg tggttttgag gggcatgggg gttctgctca tctctgtgtg 5400 

tctacccagg tctctagtag tcactttagc ccagatagat tctttagcca ttaactaaat 5460 

attggcttcc tagtttttca ctacacatac aataacgtac ggtaatcaga tgtgtgactt 5520 

tgccatgtgt acatacttcc agaactgcca gttcaaaata aacagaccct ccttgacatg 5580 

gagaatccac ataatgtcac aggaatgact cacatacctt ccaggaagat atgtgagtaa 5640 

aagaagctct tctatgctgt ggatagtgag cagagctata aagaattaag gctatagaac 5700 

ctataattta ataagatttt tttttctgag ttttccctgg tggaataaac aaagggaaaa 5760 

15 tgcatttttg tctaccttgg gagggaaagt aacatgattg tgaagtctct tagatttaat 582 0 

gcagctttgt ttgatttcta gatatttcta agagtttgtt aacatttgta ataataggca 5880 

ccagaaactt ctatgggaaa aaagggaaaa cgtgaacaaa aaaaggaact atggcatttt 5940 

cactgaggaa gtaaagcgtg atctggagaa ataaaaaaca gacaaacaca tattggtccc 60 00 

aggccagttt taagaaaccc ttttatacta agcccttttt aacagaaagt gttaatagta 6060 

25 ttcaattttt ttaaaaaaag aaggagaact cataaaattc acttagctct cctaattacg 6120 

gcaaattatt ggaacaaatt gttttgttaa attcaatcag tcttaggtct tagagacgct 6180 

acaaatggga ttatcttatg gacaaatgtg acagaaatgg gatgggaact tgaagaatat 6240 

aatgatttct gaggatcagg aaagtggaat taagtccttc tgatttcgag aaattactgg 6300 

gcaatagtgc agatcagctg ctgttgggag atagaaatca ttgcaatttt aataggctaa 6360 

35 gatcagtggt agtgcaaagg agggcatagg atgaagcaaa ctttatagat ataaaatgca 6420 

aacctataaa atggccagga tgtcatattg gagataaaga cacctttggg agactgtact 64 80 

agagaggatt agcaaggcac ctgttgtggt tacacacaat cccagtgtct tctaaacccc 6540 

agtttaaaag acagttatct gacctctc.tc caagtttatt acagactaaa gaatcacagt 6600 

tgcctgtaaa tatgtctatg tttgaggaat tgaatgacta tcactaagag atgggattcg 6660 

45 aatttattgg agcttccaag agtctttaca agaatggcaa gaatgttgca ggggcccagg 6720 

gccgggcacg gtggctcatg cctgtaatcc agcactttgg gaggctgagg cggacgtatc 6780 

acgaggtcaa gagatcgaga ccattctggc caacatggtg aaaccccgtc tctaccaaaa 6840 

atacaaaaaa ttagctgagc gtagtggcat gcacctgtag tcccagttca ggaggctgga 6900 

gcaggagaat cacttgaact cgggaggcag aggttgcagt gagtcgagga gccaagatca 6960 

55 taccactgca ctccagcctg gcgacagagt gagactccgt ctcaaaaaaa aaaaaaaaaa 7020 

aaaaaaaaag aacattgcag gggccaaggg tatagcttcc cttttgccct ctgaagtttc 7080 

gctgaaaagt caacttgtaa aagtcagatt aattgagaaa aggcatacaa attattcaac 7140 

acatagacat gagggccttc agaatgaaga cccaaaaaat ataggggaaa gtatccattt 7200 
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gtatgcttag gttcaacgaa gtaaggacag 
tgctaatagg cttagttaga accgagcaag 

5 

ctgtgctgca ttccttcatt ttggttgtag 
tgacctacag tcaaacaaga tagctcagat 
10 gagtggaagg agagttagag taacactttt 
tgtctggttt ctatgaccca ccttggggag 
gaagatagga ctgagagaca ggaaaggaag 

15 

gttttggggt attgttttct gagcccctaa 
ttttattgtt cttttcctcc ttttttctct 
20 ctttcgagaa tgaatgtgca aaattcatga 
gccttacaag cattaactgt ctatcatttg 
aggattctca cactagccta gagaacaatt 

25 

gggagagggg cagggaagtg ctaagtagag 
tcggcctttg cccacaaacc caggtgagga 
30 catttccaag ccagccatgc ccctcctcct 
gtatacacac acaagtggct ggatgtcaag 
agttgctggc agaccatcga tggcagaaca 

35 

gtggttggag gagagccagg ctgctgagca 
tccatccccg ttctggctcc ctgtccatct 
40 ccttgcactc actctccaag cccacatatg 
accctgggat acagaaagcc ctctgtcctt 
ttaaagcaag gtgcctgcgg acggctaagc 

45 

tgggcttcgg gagctgtaaa cactcaaccc 
tccccatgac ctgcccgtct gcatgcttcc 
50 aagtgagcca cacccccatc acgcaccctg 
tcaatatgtt aaagcttatt caccccatga 
tgcctcaatt tcagagcctg ttattggtct 

55 

tcttgggagg gtgcatgtgt ccaggaattt 
tgtgtagagg tgtttatagt attttctgat 
60 gatattccct ttatcatttt ttattgcttc 
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ccctgtagaa 


atatgactgg 


atataatcta 


7260 


acccgtctgt 


ctaggttttt 


cttggcctct 


7320 


gacaggacca 


tctctggaat 


gggcatctta 


7380 


gatttcttta 


tgaccagttt 


tacacagata 


7440 


aggttttaag 


gctggctgtg 


gcaagagggg 


7500 


ggattcttgt 


ttgtacagct 


agcctcaaag 


7560 


tcagagaaaa 


acttttgctt 


ctgaggtatt 


7620 


atacataata 


ggtatctttg 


tctttgccct 


7680 


tatgtgccac 


tgtcaccatt 


ctcaggtggg 


7740 


ttagggtcaa 


ccacatctcc 


atagctatgg 


7800 


catcttcaga 


ctgatattca 


gaacagtttc 


7860 


ttttaaataa 


ttccatcaac 


atattgatat 


7920 


aagggcaggg 


tccctggcga 


ggttccaccc 


7980 


caggcactcc 


tgttttcatg 


ccaaatgttg 


8040 


gtgcctataa 


aaaccctgat 


accctagtgg 


8100 


aggaacacac 


tggcagaaga 


acacaccaac 


8160 


aagcagatac 


caaaggaaat 


tcagctgagg 


8220 


gccctactcc 


agagaagacc 


accttcccac 


8280 


gttgataact 


actatcacca 


ctcaataaaa 


8340 


atctgatttt 


tctgggacac 


ttgggcaaga 


8400 


gcgataaggt 


agagggtcta 


actgagctca 


8460 


tgaaagagca 


cattgtaaca 


tacacccatt 


8520 


tagatgctgc 


catggagtcg 


gagcccacgc 


8580 


cttaggggtt 


tgagctgcgg 


ggcaccaaag 


8640 


caagggggat 


aagggaaaac 


tcccctcgtt 


8700 


gagccatttg 


gttggtaggc 


tattaattat 


8760 


attcagggat 


tcaacttctt 


cctggtttag 


8820 


atccatttct 


tctagatttt 


ctagtttatt 


8880 


ggtagtttgt 


atttctgtgg 


gattggtggt 


8940 


tatttgattc 


ttctctcttt 


tcttctttat 


9000 
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tagtcttgct agcagtctat caattttgtt 
attaattttt tgaagagttt tttgtgtctc 
5 tatttcttgc cttctgctag cttttgaatg 
ttgtgatgtt agggtgtcaa ttttagatct 
tataaatttc cctctacaca ctgctttaaa 

10 

tttgttctcg ttggtttcaa agaacatctt 
gtagtcattc aggagcaggt tgttcagttt 
15 cttaatcctg agtaaaatgc tattcttaac 
tgttactagg cttattacta ggactatgta 
tctaggagaa tgaaaacacc atagaggaag 

20 

atccaagcac ctagaacaca gcctggcaca 
gaaaagtgag gaatattttt catgtggaat 
25 ttcagtaagg agattttctg tgaagtacta 
agcagaggca gagaccaaga gaggcagaaa 
agagactccc ccatgcttag tcatcccatc 

30 

tacaaagcta gagatgctca gcctcatccc 
ttcatcatca ctgaatggtg agtcccagca 
35 catttgtttg tatttcactt ggaaggtagg 
caagtccaga tgaccagtaa agcaagttcc 
agtcccagcc aggcacagtg gctcacgcct 

40 

agcagatcac gaggtcagga gtcgagacca 
actaaaaata taaaaaatta aacaggcgtt 
45 ggaggctgag gcaggaggat ggcgtgaccc 
cccgccactg caccccagcc tgagtaacag 
aaaacgatca cagagcacag tccctgaatt 

50 

ttactttcta gggtgtgtga cagggtaagt 
gccaaaagac agaaaccttc tgttatttta 
55 cctgtgtatt aaagaactgc aaaggcataa 
actgcatgaa tcagccacag cccagtcact 
gtttccccac ctgtaaaatg ctgaagttgg 

60 

ggcgtgatac gagattctaa gaaacaggga 
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gatcttttca 


gaaaaccagc 


tcctggattc 


9060 


tatctccttc 


agttctgctc 


tgatcttagt 


9120 


tgtttgctct 


tgcttctcta 


gttcttttaa 


9180 


ttcctgcttt 


ctcttgtggg 


tatttagtgc 


9240 


tgtgtcccag 


agattctggt 


atgttgtgtc 


9300 


tatttctgcc 


ttcatttctt 


tatgtacgca 


9360 


ccatgtagtt 


gagtggtttt 


gagtcagttt 


9420 


atcagttctg 


ttccattggt 


ctattcacct 


9480 


gtatgcttcc 


tcactgttct 


atttgaacat 


9540 


gtggtttttt 


ctgttctgtc 


cattgatgca 


9600 


gaagaagcac 


tcaaactgtg 


gtatttcaat 


9660 


tttttaaagg 


aatgttgtgc 


cctacacctt 


9720 


cgcgaggcct 


gctgtgtgca 


tggcgagatc 


9780 


cctggcagag 


ctgctgagga 


tggtgtagaa 


9840 


atgcccattg 


tcatgtgtag 


cctccagtta 


9900 


catttgagat 


aattgcattc 


ttttgcccta 


9960 


attctgatat 


caagagcacc 


aggaaataac 


10020 


gtctcacacc 


aggtttgggg 


atttactgtt 


10080 


tggaagtaat 


aatatagcag 


ttaagagcac 


10140 


gtaatcccag 


cactttggga 


ggctgaggcg 


10200 


tcctggctaa 


cacagtgaaa 


ccccgtctct 


10260 


gtggcgggcg 


cctgtagtcc 


cggctacttg 


10320 


cgggaggcgg 


agcttgcagt 


gagcagagat 


10380 


agcaagactc 


cgtcaaaaaa 


aaaaaaaaaa 


10440 


aagattacct 


ggattcaaat 


ctagacttca 


10500 


tacttaacct 


ctctgtaatt 


taggttatca 


10560 


gcagagaata 


cttgctagaa 


agagtcatta 


10620 


cty aaaaOflL l. 


UCICIm uao va'- 


aaacataaaa 


10680 


actaactccc 


taagcaatcc 


ctggccttta 


10740 


ggggaaaaat 


cgtttcactt 


tcattccggt 


10800 


gaaaattccg 


tgagtggagc 


caataaccaa 


10860 
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gtcaaacatc 


aggggggtct 


atgaacactg 


ctccaaggaa 


tctctctctg 


gataaccaga 


10920 


5 


ccttgtgccc 


tagccccagc 


ccgtcatctc 


cactgctaga 


gaaaaaaaat 


acagttagtc 


10980 


tgtccaaggc 


tgttagagtc 


ctccaacata 


taagacgcat 


ttccatttta 


ctagcttcat 


11040 




ggtctgcttg 


gggtggggat 


ggcggggaga 


tcgtatcctt 


gaagtaggca 


ctaggactgt 


11100 


10 


gagtcacttg 


cctgtctaga 


gcaccaggac 


tgtttgcaaa 


gggctgagga 


gatttccatc 


11160 




aatggaaaag 


agcatgaatt 


tagagtaaac 


atagattaca 


cttagagatc 


agagatcaat 


11220 


15 


ctagagataa 


aatacacttt 


tctctaaagg 


gggtgggaga 


agatgttgga 


caagaaatac 


11280 


cagattttct 


taatggttca 


tttatgtcag 


gctctatggt 


ggggactttt 


atgttatttg 


11340 




acttaatcct 


cacaacaatt 


ctgaaaaatc 


ttttttcaaa 


ttttatgaga 


aagtgaactg 


11400 


20 


gatatttaac 


tgattttctc 


acgaattcct 


tctgaggctc 


agaagtgtga 


aagaagccca 


11460 




aatgaccaag 


gttctttaac 


aatttggtca 


aaacaggtca 


tagacccaga 


ggaaggtctg 


11520 


25 


gatcctggaa 


gttttcttaa 


gcaaagacag 


tgcttgttgt 


ggatatgaag 


atgttgacta 


11580 


agctgggcac 


agtggctcac 


acctgtaatc 


ccagctttgg 


gaggccaagg 


caggtggatc 


11640 




acctgagatc 


aggagttcna 


gaccagcctg 


accaacatgg 


gtgaatcctg 


tctctagtna 


11700 


30 


aaatacaaaa 


aatagacagg 


catggtggtg 


ggcacccata 


gtcccagcta 


ctggggaggc 


11760 




tgaggcagga 


gaatcacttg 


aacctgggag 


gtggaggtta 


cagtgagctg 


agactgcacc 


11820 


35 


attgcactcc 


agcctgagca 


acaagagcaa 


aaaactccat 


ctcaaaaaaa 


aaaaaagatg 


11880 


ttgaccaaat 


ggcagactct 


tgccatgaag 


caagcccaga 


cctcagtgcc 


aagagatgtg 


11940 




aattagattc 


ctgcctcttc 


cacagactag 


acatgtgact 


gggggcagat 


tccttcagca 


12000 


40 


atatcacctc 


atttggagaa 


gaactggact 


aaaataccct 


ccaagtagtt 


tatgagttat 


12060 




ttaaatctcc 


caggcttagg 


tttgctcctc 


tcttagggga 


tagccttgcc 


acccttcagg 


12120 


45 


aaatggagac 


tgtatatgga 


gcatgttgtg 


ccgccagctg 


ctgcatgtcc 


cagagccatc 


12180 


ttaaacccta 


cccttctgaa 


caatggctag 


tctacaagaa 


gccttaaaag 


ggatgtgaaa 


12240 




gtggcaacag 


tgtcccctca 


acaacaaata 


cttccccaaa 


tatgagcttg 


tgtggagcct 


12300 


50 


gatgactctg 


cctctgaggc 


caatccactg 


gcatctttcc 


caagatgtgc 


ctggacttcc 


12360 




ctgagcatga 


ggccaaagat 


ggagaatgga 


aacacccaaa 


agaagtaaca 


aggagacacg 


12420 


55 


gaattttcca 


caggttgctt 


tcagcttacc 


aagagcctgg 


aaagtgagag 


aatagattct 


12480 


ccaaaattca 


aaactaacat 


tactattatc 


aatatttatt 


gtctactttt 


tgtaaactct 


12540 




ttgcatcttt 


tttctttaat 


tctcccaaca 


cccacagagt 


ggaattatta 


ttatccatcc 


12600 


60 


accatttatt 


taatagacaa 


gaaaactcag 


gctaagcaaa 


actagatgac 


ttaaataatt 


12660 
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ctatgtagga gtagggaaaa tgaattttaa 
aaggtctttt tctcatgtgt cacttctatc 
5 gaaagaggaa gcagaggcac tttttatata 
atcctgtgta atcataaata ctgctctaag 
gcagagcacc aagattgttc tggcaaggaa 

10 

cccagaggtg agctgatccc cagccacagc 
caccttggtt tagatcatga cttttgggga 
15 gaattggatt atcatcctgt ctctgattct 
agtccccttt ttgggtctca gtttactcac 
tcttaggttt ccctcattga ggtgattttc 

20 

ctcagtctta cagagtctac tctcaacaaa 
gaaccccatg gtcctactct aattctgtaa 
25 ctccatgtca tgatttatct tcaaagcacg 
ccctccagtc tatctcctca aggcgaatgt 
tgtttggaga tgatagcatc agtctggaga 

30 

ataggattcc ttctaaaagt agcaacgtgt 
taataattcc acttccatac atcactgtga 
35 tgcaaagatg ttcattattg cattgttcat 
aaatagtcat gaacatgtga ttgattaaat 
accatttggc tttttggttg tagtaaaata 

40 

gtgttttttt gttgtttttt gtttgtttgt 
gctcagtcac ccaggctaga gtgcgtgatc 
45 tcaagcgatt ctcccatctc agcctcctga 
gcctggctaa tttttgtatt tttttagtag 
gtcttgaatt cctgacctca ggtgatctgc 

50 

caggcatgag ccgctgcacc cagccttaac 
agcacattca caccattgtg caaccatcac 
55 cccaaactgg aactccgtgc ccattaaaca 
cctggcaacc accattctac tttcagtgaa 
gaatcataca atatttgtcc ttttgtgact 

60 

ggttcatcca tgttgtgcca tgtatcagaa 
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ctgggaccgt 


acaaatcact 


agtctgcaca 


12720 


ttgtgtcatt 


ttgccagacc 


tcagaattta 


12780 


tcaactgtct 


gtaactacag 


caaaattcac 


12840 


aaagggacag 


gaagtctcag 


aggctggaga 


12900 


cagccagtgg 


gaggttccag 


ctgagcgctc 


12960 


acacaggacc 


aggctgcgag 


aacaggtaga 


13020 


agattgtgtg 


cttttgaaaa 


gacagccagg 


13080 


gaccacctgt 


gtgagttgga 


gcgtcagctc 


13140 


ttgtcaaata 


agaaaattgg 


gtaaaaccat 


13200 


ctcaatgttc 


tgcattctcc 


tggtgtgtca 


13260 


tactgtgcaa 


ttagactgaa 


tcttgcatta 


13320 


tacagaaaat 


tgctgggcag 


agctgttgac 


13380 


gattttagag 


ctgtcttgaa 


agctgtgggt 


13440 


ggaaggacaa 


agcatcctct 


cagactgtgg 


13500 


cttttgtgaa 


acataaagaa 


tcttggttat 


13560 


atctaaattc 


aagtgtcata 


gatttaaatg 


13620 


gataattaaa 


tgcaggaggt 


atcacaaaag 


13680 


agtggcaaaa 


agaaaaaata 


gaaaccacct 


13740 


gaatctagag 


acctccattt 


atttaaatat 


13800 


tgcttaaaat 


aattgtagcc 


ttttaaccat 


13860 


ttgtttgttt 


tttaattgag 


aaagagtttt 


13920 


tcggcttact 


gcaaccactg 


tcttccaggt 


13980 


gtagctggga 


ttacaggcac 


tcaccatcat 


14040 


agacggggtt 


tcaccatgtt 


ggccacgctg 


14100 


ccgcctcggc 


ctcccaaagt 


gctaggatta 


14160 


catttttagg 


tgtacagttc 


agtggcatta 


14220 


caccacccaa 


ctccagaact 


ttcttgtctt 


14280 


a. u. Qy ^ ^- \— \— ^ 


attcccctct 
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14340 


tttgactctt 


ctaggtgcct 


cacataagta 


14400 


ggattatttc 


acttgccata 


ttgtcttcaa 


14460 


tctatcattt 


gcttttaaat 


ttaggctttt 


14520 
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tattatttct attcatcggt caaagatgtc 
gttataaaat ggcatggata gaatgactat 

5 

atataaagag gagagagata aagtgacaga 
taggaaacaa gagtgatcgg gtggcagttg 
10 caaactgtat tcccttgtaa tggtggtact 
taagaaataa caaaggaaaa taaatgttca 
tcaattaaac tttcgagtcc tagcttcagg 

15 

ggtatgagat tcgaaaaata aagaaaggaa 
tagtaggaag gagtctagtt ggtatatttg 
20 gccaagtttc tacaacatga aagcaagttg 
aagatcagca atgtccctgg gagaggatat 
caatagggga ggaagaatag agaggatgag 

25 

acctacaggg atatagaaga gctctggtgt 
ctacctgcta gataagcaat cttgcacaag 
30 tatcttcaaa gttcagcacc catctctgag 
tcttaataga atgccacatt catagtacgt 
ttatcatgaa ctactgtttg attctggaac 

35 

tccttctctg gctttaaaga tgtgctggaa 
gcttcaacat tattcccctc ataggacatg 
40 aaaattccaa gaaatgaaaa ctatctcctt 
ttcagataaa gtatctcatg cccggagagc 
gagtagaaat aaaatggagg tccaaacatg 

45 

tcagagagtt acaacaaaag acactcacgt 
aggatagtgt cattatcctt attttacaga 
50 atcctgccta cggccagctc atcaatatca 
actgatatgt tttgggagtt tatcacatgc 
catcttatcc aacctgtacc acaaacctat 

55 

gtttagcagt aactggtgga tatgggatct 
tcagatgtgc aactagatgc ctaggcattg 
60 cctcactagg aagagattct aggtgtcatt 
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tacagaataa 


tgttgagtac 


aaaaagaaca 


14580 


ttccgtagaa 


ttgtgaatgt 


ctgtgaacat 


14640 


tgggaagaga 


gaatgtaaag 


aatatgtgca 


14700 


aaagttaagg 


agctaatttg 


ttttcttctt 


14760 


ttatgcactc 


tttcaaacag 


aacaaacagt 


14820 


tggtatcaga 


taggctatag 


cctgaaagta 


14880 


ctgagaatca 


agcattttct 


gcagtcctta 


14940 


cccagaactc 


aggaggctgg 


tgacgaagac 


15000 


ggctattgag 


ccacacagag 


caaggtaaaa 


15060 


ggaaaaagaa 


atcttcaacc 


tatattccct 


15120 


ggaactgatg 


agtggcagcc 


aatgcttcaa 


15180 


aatcagttgt 


agctagccgg 


gcaagaacaa 


15240 


tagatggcct 


gtgcccaaat 


cttgcctcca 


15300 


ctatttcacc 


tctccaagcc 


tcatttttct 


15360 


actgcagatt 


aaatgagact 


gtcacaaagc 


15420 


gcccaatcaa 


tgttagcaat 


gattatcatt 


15480 


ctagaatttc 


tctaagcctg 


acactttttc 


15540 


ggctcaccct 


gactccactg 


cagggattca 


15600 


taacctcacc 


cacacttcta 


cccagatttc 


15660 


aaacaaagag 


aactttaggt 


ttaataattc 


15720 


atgagaaact 


agactaaggc 


acagatctag 


15780 


gatcgactat 


tatggaacac 


ctgataaaac 


15840 


gtgtcattca 


aatcctcaca 


agagtccttc 


15900 


taaggaaact 


ggggatgaat 


gagggaaagt 


15960 


gaactgggct 


tccaatgaat 


attagataaa 


16020 


ctccccctat 


gctgagtcca 


ttatatgtgt 


16080 


gaagtggtga 


aaataccttg 


cccagggtga 


16140 


gaacacacga 


ctggactcca 


agacatatgg 


16200 


gactcccatc 


ccagtgcact 


tttccatctc 


16260 


tccaattttt 


acatgttctt 


cactccaccc 


16320 
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aagagaggcc agaaccattc ctaaagcacc 
agctgtggca cctttcaagg atgacaagcc 
5 aatgttgagc tgaaaacaaa tctgggcaac 
cacagaggcc cagcaatgct gctgagttag 
caagggaact ggaaatgtta ggaaggaaga 

10 

acatggcata gagaaaaaaa tatcacaagc 
tgtaaacaca ggctgttaat atttgctacc 
15 aagtacttta tgtgactatt tcattcaata 
tctaatgtat aaaattgatt taataacatg 
ctgcaatggt ctaatgcttg gaattcagca 

20 

acgaagccaa tataagcaag gaggaagatt 
aaggaaagtg gaacagcatt catcctcaac 
25 tagaagactg atcagtgcag gtgtagcata 
aaggtagcac aagtgaagca gagatacagg 
cactggaggg agagggagat ggacacattc 

30 

ctcacctctc aattgtggga cccctcaaaa 
cttgaccaca atggatgaac tgtgcccagc 
35 catgttttgt ttctgaagga caaccagcct 
ctacaaagta tggtccctgg gaagtttatg 
gttttgatgc ctcttccagc atcatcagca 

40 

tttctcacag ctttacacca aagggcatca 
tgtaccaaaa cgaagattac ctgcagaacg 
45 taagtccacc tcattataag gggaatactg 
cagatctaca gtgggtctgg gaaactcttt 
aggacatgaa atggagcttt aagggaagac 

50 

taaaaccact agatcttttt cactagcatg 
tttatttatt tatatattta tttattttga 
55 agtgcagtgg cacgatctca gctcactgca 
ctgcctcagc ctcccgagta gctgggacta 
ttgtattctt agtagagatg gggtttcacc 

60 

ccttgtgatc tgcccgcctt ggcctcccaa 



PCT/US01/48437 



aacacattga 


ggcaaaatgt 


caggcagcag 


16380 


tttaactaaa 


ctctggcctt 


cagtagggag 


16440 


atttcaggtg 


ctgagccttg 


ttgctgacac 


16500 


tggtggcaga 


caacttgctc 


ctggaatcta 


16560 


aaggaaggaa 


gaggaagagg 


aggagaatta 


16620 


gtctgagcca 


gacaagtctc 


aatccagtca 


16680 


atatattgaa 


cacattctaa 


gtgccaggct 


16740 


aaagtcactt 


gagcttatta 


gccccaggtt 


16800 


taagttacag 


ggttgttata 


aggattagag 


16860 


ggagttaagt 


aaatgtttac 


attactaatt 


16920 


ccggatggga 


caggccatct 


cgctatagga 


16980 


atttttacga 


agacaaaatg 


aagactggag 


17040 


aaagtgtaat 


cctggaagat 


gtggtgtgag 


17100 


agatagggaa 


gggaagctgg 


aagcagaggt 


17160 


agggctacaa 


agcaagttct 


atgtgatttg 


17220 


tgtgtacagt 


actctcccag 


tgacatgctt 


17280 


atgcccactt 


tccaatgctc 


cactgatccc 


17340 


tggaataatg 


gcaaatacct 


tcttaagttc 


17400 


tctgtaagtc 


aaaccttggg 


aagtaactga 


17460 


tcatgctatt 


acaatcccaa 


accatggggg 


17520 


ctatccctca 


aagagagaaa 


cctggacaca 


17580 


ggctgccaac 


agaaaccacc 


gttcttgggg 


17640 


agaaaatact 


ttgtaaatgt 


atctagacta 


17700 


cactcttttc 


tggctgactc 


caggctaatg 


17760 


tcttaaagct 


ctgcagcagt 


ttctagaaac 


17820 


cttcacaatt 


aacaggttag 


tctttcttat 


17880 


gacagaatct 


ccctctgtca 


cccagcatgg 


17940 


arrt ccaccfc. 
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cctaaattca 


aatcattctt 


18000 


tgggtgcacg 


ccaccacgcc 


cggctaattt 


18060 


atactggcca 


ggctggtctt 


gaactcctga 


18120 


agtgctggga 


ttacaggcat 


gagccactgt 


18180 
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gcccagccac aaattagtct ttcttatttt 
accatatgga ttttactcaa tatgattaaa 

5 

gagatgagac gatcaagact aatctcatga 
taagacatcc aagtgtccca gatcatcatc 
10 gttttgaata ccctcttata atacccaggt 
ctgccctgtt tctttacaat ctattcttag 
aaagttagat catgtattct cctggctcaa 

15 

aaatgaaaag caaagttggt aaaacatagc 
ctccagtact tcacaagcct catgtcctac 
20 atggaacttg tctgtacctt tacagacaag 
agtgattttc atttgcttca ttccctcgtg 
caataactgt tgatgatgtg aatggttgac 

25 

atctttcaat tgcacatgca ttaaactatg 
cagctaaact tcagttcttc cacttgcgaa 
30 aaataagatg aattacagca gaagagatga 
tcttcattaa cactcctgtg aaaagtttgc 
ttatctactt ttcaagaaaa taaatttaaa 

35 

aaaatgtgtg tatgatcact ttccgatttg 
ggttagctat acatgacttt ctctagaagc 
40 tctttaatat aacaaattat aatgtatttt 
aataatcact tttatctgtt tgaagtgcag 
tgtccagatc ctgtgttgcc tgttgatttc 

45 

ctacccctcc cacttcaatc cagcaatttc 
aggagctctg tgtgtgagta gaatggggac 
50 accaggtgca tagtcgtgga gtgactcaga 
aggcctcgac tttccttcta tcattgcccc 
gcagagagac ttaatctgtc ttcctgcaga 

55 

gggtatacca gcacagaatt tcttacctgg 
attttccata tccctctttg aggctttgat 
60 taatcaactt ttcaggtaga gagtgagaag 



gtcattagta cagtaaaaat tttctattta 18240 

ttaactttga gcttattttt caaatatttt 18300 

cccatttttc cccgttcata tgagtaacta 18360 

cctcatatcc agaactttaa tctccatgtg 1842 0 

cctattcact gcctatttag atggatctcc 18480 

tacagcaggt agggtactct tttaaaaatg 18540 

aattatccag tgggattcct tttttgctca 18600 

ctaaaaatcc ctacatagtt tgtaccatgc 18660 

tactttgctc ttaactctat tctgccttgc 18720 

acatagtaca catataagct ctccaatggt 18780 

cctggcacac aacatacctg ttgtggtatt 18840 

ttcttttttt atttctccta tgtttgcatc 18900 

agcaaaagca tgggctttag agcaagctaa 18960 

ataattatta ttaaaatcca atgatgaaca 19020 

cctgctctat ctaatttcaa ctgaaaaatc 19080 

tctctatttc cacctcactg gacaactttt 19140 

aactgtcaag tatctttttc aaaaaaagtt 19200 

ctggtttaat aatcctatca gaaaagaaga 19260 

atatcgattg gctccttgta acatttttcc 19320 

atactcttta tactttaccg tgtgcattca 19380 

tctagatttc ctgtcttctc tcttgtagac 19440 

aagtctgggg gccatcttgg tttttgctcc 19500 

caccactttg atgtctgggt acccattttt 19560 

tctaagaggg gacatgcttt ataaatgcta 19620 

ctccaagagg ccaggaggca aaaggcggca 19680 

acgtcacatt tcccctattc aagaagagca 19740 

gcagggggca agagggaatg gtaatacaat 19800 

cattacagca tacctctaga aaagtggatg 19860 

gaggccattg gattccctcc attgtaagaa 19920 
gccattttta aacaataaaa cgagcaccac 19980 
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tggttgatgc cacaggtttg cagagatttg 
gggacatgct tgctgtgtgt ccagaggtaa 
5 ccttccacct aatggctcat gaaaaagggc 
gggcagactc acacttccca caaagggtta 
gataaaaatc aaatattctg ggcaaatgaa 

10 

cagtccctgt gtgaacacaa actgtgcagt 
gcacaatttg tccttgactt ctcctgtaac 
15 gaagctagcg ttagctcata ctggaccctc 
cacacttaag caacttttca gcttctcaga 
taccactgag attaaatact gatttctggg 

20 

tgctcttccc actttcctcg actcccattc 
agtccatgtg tttctgcacc ttgtcccatc 
25 tgttccaatc tctctgtcca ttcaagtcat 
ctgtaattta gtaacactct tctccttccc 
tcctagtgat ttccactaaa acaaatccaa 

30 

tcaatataga cattcttata tcaggtggga 
agatttctgc agtgttctct ccctaaaagt 
35 gacagtgaca tggtgggcac atttccacaa 
tgggcagttt ggcattactg gatccctctc 
ctttgtaagt ataaggacca tcaaatctca 

40 

aattcttcta gtttaaaaat ccctatttta 
taggaagccc aggttttcct ggctgtccca 
45 cttgccttct agaccagtgc tctttcgtct 
gcttggagga ggtttacctc agttatttaa 
tctttatagc ttgtaacctt gaccttgaat 

50 

cagatccttt tttctataaa acctgcctct 
ttcatagagt tgggacttca cacaaattct 
55 tggttttccc aggactgtct cagttttagc 
actccccagg caaagcagaa taactggttt 
aaagaagtcc atgatatatt ttcaaagttc 

60 

tcagtcaaat atcaggtggg gaggcattca 
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gattaacaag 


ggtaagattc 


atgggaggca 


20040 


ccctgagcac 


tagagaaact 


ctctttgctg 


20100 


ccagtagatc 


catttttcag 


agagtggcca 


20160 


tttcttctcc 


acgtagtatt 


tggtacaaag 


20220 


ctatactaag 


gtaccacttc 


tgtgtggata 


20280 


cacataagga 


ggccctggtc 


ctgccccaga 


20340 


cagtggttgt 


gggccccaca 


tctccttaga 


20400 


tttgcctgtg 


attattcaat 


atccaagcca 


20460 


atacccttcc 


tctgccagct 


ggatctttaa 


20520 


gctttatgat 


aaagacttca 


cgtctgagcc 


20580 


tgactcaagc 


aacatacacc 


aataacttcc 


20640 


agagcctctt 


tcctgagtgc 


ttctgttggc 


20700 


tggtccatct 


caacctcctc 


tctttcaacc 


20760 


aggtacacct 


tttctccagg 


cctcctcagt 


20820 


cccctctgtg 


catctgcatg 


tcattcaaag 


20880 


attaggccta 


tctagcaata 


tagatccaca 


20940 


tttccaacat 


cactttttga 


ttgagtgatt 


21000 


gacaactgaa 


ctgatcattt 


ctctctcttc 


21060 


aattatctct 


ggaaaacaat 


caactaagcc 


21120 


gctggttgga 


attgaaggaa 


cgtcagagtt 


21180 


caatttagag 


agaaaaagct 


tagaacaggc 


21240 


ttggtagaac 


caagagtaga 


gcccaaattt 


21300 


acacaatgtt 


gctcctggat 


tatattccaa 


21360 


aaagtgtaca 


tttttaaatg 


taccatggac 


21420 


aaggtagcca 


cccatcccag 


agagagccta 


21480 


tggggttctt 


tttgatgcca 


aatgtcccca 


21540 


agcccttcag 


cgtcagggtg 


accaattgtt 


21600 


f fana a o /~t /*■» 
CCdgaadgCC 
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accctactct 


gtgaccgtgg 


taacaagttg 


21720 


tactatttaa 


gcaactgaaa 


taaacttctt 


21780 


ttcccctgtc 


agaagagggt 


gttgccgtct 


21840 
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tcatccaggc tggaggagac tatttccatc 

gagaaaaaaa atcagtttaa agtaaggtgg 

5 

gggggatgtc tgcaatgtca gcactgattg 

tgccactgcc aacatacccc tggcttcctc 

10 gtgcatctag aatatgacct ttggtccctc 

aaaagagcgg cttcagagaa aagacaaaag 

aggtggatca ggcctatgag aaccacattc 

15 

ttttaggaga caggattctc ccatagcatg 

aggctctgtg cgaggtttcc agcacatgct 

20 tgtatgttgg gaaggatggt tgtcctgccc 

ttcatgtgct gagtctttga aggaaaagta 

taccatatga gcaatggagc ccacatgttg 

25 

ttgtcaagtc caagcataga tttccctttc 

ctgagatcag aagattcatc ttccacattt 

30 gaaggacttg ctcaagcaca cagatggatc 

ataggaggag aggatttcag tttctaagtc 

tgaactcttc aaaaccctct tccataatcc 

35 

gagcatatac cagcctcatg caccatccat 

tatcttcttg agtcaggatg tggcttttcc 

40 aagagagaaa ggctgctacg gagagaagca 

cctccaaggt ttgtgggaaa agtggctaca 

ggagagagaa gcacagggtc taagcccatg 

45 

aaatcgaagg catatgcttg tcatattgaa 

gcttgtcaat tgaaagccta ttttatgttc 

50 ttccagtttg cctccacctt ttctctgaag 

ttcagtttac cagaggggac cgaagtccat 

ttctaaggct attgaggaag tcatgggaga 

55 

tatcttgagt gctatagaga gtcaatgggc 

catgcatttt cttattcaat aagcatttac 

60 acaagagata caaagatgaa taaaacacag 
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caaagactca 


tgcttgataa 


ttggatgttt 


21900 


gagaaaatga 


gggtgatgga 


gtaggggtga 


21960 


cagcaccaaa 


ctcaaagaca 


ataccactca 


22020 


aacaacttta 


gaattattat 


attgccctaa 


22080 


cctagaaacc 


caatgtatgt 


cttcttactt 


22140 


caaactgaag 


aagttggaat 


acctacctac 


22200 


accacaccat 


ttcccaactc 


ctcttctaca 


22260 


acgattcact 


agcaaagaca 


ctgatgtgcc 


22320 


cctctctgtt 


taagcatcac 


tggaagacaa 


22380 


ttttctactc 


cagctctgta 


gctatatgga 


22440 


gtcccagtgg 


gcagtatctt 


tgcccaaagc 


22500 


ttctttgtcc 


cacttgtgga 


gcccacattg 


22560 


tgtgtattca 


gtgcatttgg 


gaagccactt 


22620 


tataaaagag 


ataactgaag 


cccagagtaa 


22680 


agtgacagat 


gggtcagtga 


cagagctggg 


22740 


cagaactatg 


actccaacca 


atgaatgctg 


22800 


tgggttctag 


actgcagctt 


caatcatcag 


22860 


gtgtcctcat 


ctgtgcctag 


ccaggccctg 


22920 


ctaaatcacc 


tctatgggtg 


tgtggcaggg 


22980 


aaggctttat 


agctgcttcc 


tccctttctc 


23040 


gtctgataat 


atttctgttt 


gtgaatgggt 


23100 


tgcctctgcc 


tcatgccctg 


tcacttagga 


23160 


caaatattga 


ttgtcaattc 


aagatcaatt 


23220 


ccacagacct 


atagataggt 


tttccaactt 


23280 


ctgctctttt 


atgtaaatca 


aagtcatgga 


23340 


attctagaag 


ccctgaaggc 


aacagaatga 


23400 


tgttggcagt 


gacctgagca 


gcactgtctc 


23460 


cagaacccct 


ctccctaagg 


aggaattatc 


23520 


taattatatc 


acatgattaa 


ctaaatagcg 


23580 


tttgtgccca 


cacaaggaaa 


caggaaggaa 


23640 
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acagacccaa ggaagcacca ttgcagtaca 
gattctgtga tgagaaccca atgcagagtt 
5 aaatgcagtg agttctgtta ctgcaggagc 
agccctgagg actgcctctc aacattgtgg 
ttattcggag tactattatc caatatatga 

10 

tttaacagtg agtagtttca gactgaatat 
ttgcatactc tctgcttttc tggcagtctc 
15 acagaaataa acaaaaactt gtggttggaa 
ggatgtacta tagaaagtca aaagaggaaa 
ccactggtct ttcttaatca caaacgttcc 

20 

acctttgttg tactcatcac aggcagaaat 
tcttcatttg aattatattt taatttggga 
25 gcccctaccc cttaaagatg cttgtcttct 
ctgtacggtg tttaagttaa tcttcaaatg 
aaaacttgaa ccttaagtcc aaatacatag 

30 

cagagagtgg tgggaaggga gacaacggat 
gggcagcatt ttcttggcac ctttctccca 
35 caagccttgc ttattcaatc ctataaatat 
ggccttttaa gaagaaagac ttcttcctct 
ccaaggtagt atctctcatt cccttggatc 

40 

gcaagctttt gcactcctgt gggtctgctc 
ctttggttca ggagcttgag gaagtggaag 
45 tagatagtgt catcctggtg taggacagac 
atattggggg tttaatttaa agaaaatgaa 
agacaactgg agtccaaaag aacttcctac 

50 

aagcagactc caaaggccta ctctttgtta 
atgaaatttt aaaaattaaa aaatatatag 
55 cagaaggaaa aacaaatcta agaagaaaga 
ctaaagccaa atcatcaaac aagagctgag 
tattggtatt agaggctagg atcacacagt 

60 

acagggccat aaagctcata tggagtaccc 
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atggacagag 


gactaatagt 


gtcacgtgct 


23700 


tctcttccag 


gacctgagca 


gcttgacctc 


23760 


aggcctcttc 


ctccttgctg 


acagcatggt 


23820 


ctcagaaatg 


gattatctat 


cctcattgcc 


23880 


aatcaaagat 


tgtctcctga 


ccagtgtcag 


23940 


cctgtcatgt 


gaaatctctg 


tgtctcccct 


24000 


tggttggtcg 


gtccgaatgc 


ctggagacag 


24060 


gaggcctgtc 


tgctgactgc 


atggtgtggg 


24120 


tcatgaagga 


cagtagggag 


tctatcccct 


24180 


aaccttaatg 


ggtttaactc 


ttaaagtaca 


24240 


attcttccta 


ctcatcacag 


gcagaaatat 


24300 


ttcttttggg 


ttcttttcca 


ggggttggat 


24360 


gatcatcagt 


gaattgagga 


ctagcagagg 


24420 


caagcctgta 


tgctacatta 


tgttaggcac 


24480 


ttattttgta 


ctttggtcta 


acagggctga 


24540 


ttgacttcta 


gaaaaaatgt 


caacctctct 


24600 


aacttgatga 


tcactacctg 


ttcgtggtga 


24660 


agtatcatgg 


tgttaaaacc 


ataaaccagt 


24720 


atgaatcctc 


ctagtaggag 


cacagggaac 


24780 


catctatctc 


acaaatctca 


gaaaatgtgt 


24840 


atgtgaccca 


gagattgtca 


aaagcaaccc 


24900 


agcacagaga 


cctgtctgga 


ggaaacccag 


24960 


atgtggtaaa 


gcaatggggg 


tgggagaggc 


25020 


tttcattacc 


ccaaactgct 


tcccattctg 


25080 


aatatgagag 


aagttgcaac 


atagcagact 


25140 


attaacacag 


caattacctg 


aacaaaagta 


25200 


ccaaagttgg 


aagaagaaag 


gaagggaggg 


25260 


tgacttaggt 


tccagaagct 


aagaaggatc 


25320 


gccaaaatac 


caaggaagtc 


tgaataggaa 


25380 


tttaacatcc 


atgacaagat 


ttggcattgg 


25440 


tgcaaaaatg 


ttgaaagctg 


atactgaata 


25500 
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tttactcaat ttgtgaaaat aatataagga 
agaaaacctt ccgtcagctc actaagcttt 

5 

catataaaat agggggacct caaataaaag 
gaaacttcta aggttctagc agaatcaaat 
10 aacccaactt ataagatgcc tccctatcaa 
tgaagttatc taataaccaa aatattatga 
agcggcagca aatggggaaa catgaaagtc 

15 

atttgaacaa ggctattatg aaaatacatt 
agaaaccagg ccaggtgcag tggctcacaa 
20 cgggtggatc acccgaggcc aggagtttga 
tctctactaa aaatacaaaa attagtcagg 
cttgggaggc tgaggcaggg gaatcacttg 

25 

agatcacacc attgcactcc agactgggtg 
aaaaagtaat agaaaccaaa atggaaaaaa 
30 atgtgcattt gagaaaaaaa tgaaatcaca 
aaaaactgag cagatcagtt aaacagcaga 
atggaattta gatttgatta aatcacacag 

35 

atggatgaaa gcctaaaagg tatgatggat 
tagtaggagt cccagtgaga gattataaaa 
40 aaatttttta gaatttaata aaaacatgag 
ctgcaggata cacagaatca aatacacacc 
atagtcaaaa agaaaataat aaaagcaacc 

45 

cattaggtgg gataccaaag ttctgagatg 
ttaagaaatc tgaaaaagaa agcagaaaag 
50 ctctgcattg ctggagcctc atcgccctgt 
gtcttgggta ccagccattc atgtcctgat 
ttcactgtgc tggagctctt attagctgca 

55 

tactccaaca accctggggt gagtatgctg 
gtccaggcta caataatcca acctcaaaaa 
60 tgaggtgcat gtggaaggcc acttttataa 
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aaattctgcc 


caacatccca 


agaagtttca 


25560 


gtgcaagtgt 


gccatctgaa 


ttcataatta 


25620 


cattaagtta 


aaaactgtta 


cgagatatgt 


25680 


ctaaaactac 


tatctaagga 


gatgtccgta 


25740 


agaaccctac 


cactaccacc 


atctacaaac 


25800 


gaaacataac 


aacagcacaa 


aaaaatgagt 


25860 


agaagctcaa 


gaattggaga 


taacataata 


25920 


ttaaatcatt 


aaagatgttt 


aaaaagtaat 


25980 


ctgtaattcc 


agcacattgg 


gaggctgagg 


26040 


gaccagcctg 


gccaacatgg 


tgaaatgctg 


26100 


gagggtggta 


catgcctgta 


atcctagcta 


26160 


aactcgggag 


gcagaggttg 


cagtgagctg 


26220 


acagaatgag 


accctgtcta 


aaaaaaaaaa 


26280 


aaaaaaagga 


cagtaaagga 


aaaaaaagat 


26340 


tacttggaaa 


tgaaaattgt 


ctttgaaatt 


26400 


ttcaatatat 


tcaaggaaac 


aaagagtgaa 


26460 


aacacaacac 


attatgtgtg 


tttggaaaag 


26520 


gcagggaagt 


ggagaggttc 


agtacatatc 


26580 


gagagagtat 


atgcaaaagt 


aaataactag 


26640 


atttcagatt 


gaaggaatgc 


attgagacca 


26700 


tagaaatatc 


atcatgacaa 


taaaaacacc 


26760 


agagataagt 


ggcagattac 


tctcatctat 


26820 


atgccgttgc 


tgcttatttc 


ccttcatcta 


26880 


aggattaagc 


aattttccct 


gtctcctttt 


26940 


ctcctcttag 


ccaggcacaa 


gggctagctt 


27000 


ttgcagggtg 


tcctagtggt 


gatgctcatc 


27060 


tacagttctg 


tcttttggtg 


gaaacagctc 


27120 


acatgtcgcc 


tgatacctgc 


tgtgtctcag 


27180 


gtggcaaaaa 


gaagaatcaa 


ttattgttca 


27240 


ttaaaaaaat 


gagtttaaca 


gtgaaacccc 


27300 
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atctctacta aaaatatgaa aaactagcca 
tagttgggag gctgaggcag gagaattgct 
5 tgagatcaca ccactgcact gcagcctggg 
aaaagagttt gtataaatgg gctccttctg 
gtgtctactg ggaagcagac cagttagaga 

10 

aaatcacata gcccaacccc tcatgaaaga 
tgaatgacac agactattgg gctgagaatg 
15 acacagctac ctgcctcccg agagagtctc 
gactttacta gaatgtctct gtcctgttta 
tagctcttga tgttataatt ttaaaatatt 

20 

agaaatgttt atagaatttg gatgaattat 
aaataattag cagactggag aattgcatta 
25 tatggcaaag ttggaagaac tagtactaca 
tcttttttaa tctccgaaat caaagtttaa 
ttcattttcc tcgacccagt cacaagatca 

30 

gtcttggata taagtaactc ttggcctcag 
caagtgtgat tagactttcc tgaaatctct 
35 aaaaaaaaag cttttgtttt gtatttgttt 
aataatttcc tcaaagccca agtcaataaa 
aaactttata aactgctttg ggtaaactga 

40 

gcactcatgc tagtgtgaat ttggtaagtc 
tttcacactc atattgctta aatattacat 
45 aattaaagac tatatataat taaactactc 
actgggccct ttttcatgag ccattaggtg 
taggattacc tctcaacaca cagcaagcat 

50 

aatccaaatc tctgcaatcc tgtggcacca 
tggccttctt ggctgggcgc ggtggctcac 
55 ggtggacgga tcacccgagg tcaggagttt 
tgtctctact aaaaatacaa aaaattagcc 
ctacttggga ggctgaggca gaagaatcac 

60 

cggagactgc accactgcat tccagcccgg 
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ggtgcagtgg cacacgcctg tagtcccagc 27360 

tgaacccagg aagtggaggt tgcagtgagc 27420 

tgacagagtg agactcagtc tcaaaaaaaa 27480 

gaggacactc tggtcatctg ggatcagctg 2754 0 

attgcttaat atgaagctag ttggttttag 27600 

accacggctt ggataaaatt acatagaaaa 27660 

agaacgcgta tcccctcact tttatcggga 27720 

cctagccaag tttccttgtt aatgtgatga 27780 

tgcttccctg tgctcttcct tacatcccct 27840 

ttcatttccc ctaaaggcta caacaacaaa 27900 

gatttaacta tgatcctatt taaaatagta 27960 

tttttaatta atattgctgc aattacatat 28020 

caaaacttgg ggaaaaggta ttatgagtta 28080 

aaattctaac caattatgta ttttctagag 28140 

tatccaacag gtcaaaaaga gttcttcacg 28200 

aggaaggaaa agcaactcaa cactcatggt 2 82 60 

gccattttag atactgtgaa acaaactaaa 2 8320 

actatgagtc gttatttaat ttctcttgaa 28380 

tgttatcagc cagtcttcca aaatggtcat 2 8440 

gcagaaggtg atacacagaa gggaaaatgt 28500 

gcgtgactct gcaggctgtt tctgtattat 28560 

attagggatt gtaagaaaac tttaattaaa 28620 

tgccctggac actctctgag aaacagatct 28680 

gaagacagca aagagatctt ctcaagtgct 28740 

gtccctgcct cccatgaatc ttaccatgta 28800 

accagtggtg ctcattctat tgtttaaaaa 28860 

tcctgtaatc ccagcacttt ggtaggccga 28920 

gagaccagcc tggccaacaa ggtgaaaccc 28980 

aggcatggtg gtgcgcacct gtagttccag 2 9040 

ttgaacccgg gaaggggagg ttgcagtgag 29100 

cctacaagaa caaaactcca tctcaaaata 29160 
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aataaaaaaa taaataaaaa taaaaataga 
aaataaatga ataaaataaa ataaaaaaga 

5 

ctcttctgtc atttcagggt aggagtcaat 
taatctctga ctggcatgtt ttctgcttct 
10 aagaataatg tctatttctt tgtttgggta 
tgtaattcag atcatcaccc ctgcactttg 
cattgctgca attatcttgt ctgttctttg 

15 

aaactcgatt tcttgggcaa ttatggaaat 
ctctatgcat atgaggtttc ctaaataaat 
20 cttataactt tctacaacaa agaattattg 
ataccctgct atcgatggtc gctacaaacc 
ttggttttag ttttatgaaa caatttacct 

25 

tttaggaata taaaggatca atatgggaag 
taattagtat ttgtgtagtt caaaccagga 
30 ctttatgaaa gctaaaatat tctctgttat 
tctttactac tgattttttt ttgtttaatt 
aaatgttctc actacaaaaa aagaaaaaag 

35 

tagctggatt gtggtaatca ttttggaatg 
taaatatata taatttttat ttgtcaaata 
40 ttcaaagtta tctactgatt tttataatac 
aattcatcaa gtaaaagaaa ctcctttcaa 
aacatagatt ttggagacag gtcaagttga 

45 

gggtgacata accttgctgt tcctcagttt 
gttagtgcat gtcaacacct tctacacagt 
50 ttagttcctt tataactcat gaacatatgt 
tctttgttta ccatatgttt gtaagcaaat 
acttttcacc aggaagatgg ggggctgagc 

55 

aactatgatt ctgagattct actactgagt 
gctccatgtg actctggtgg agaggtagat 
60 tttcccttac tagagttctg ccatagaatc 
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ctatatatag 


tctttgaatt 


catttatttt 


29220 


atattcagtc 


gttggcacat 


aaactatgtt 


29280 


gaagacatct 


tagtcttaac 


aagttacaga 


29340 


agcttggagc 


taaatgctgc 


tcatgctgaa 


29400 


ttagctctat 


tttgatctat 


agcacagttt 


29460 


taccacatgg 


gcgtattctt 


tcttcccagt 


29520 


tgtaatatgt 


tctgtgtgag 


cctctgcatt 


29580 


tccagtgtgg 


ctgcagttta 


actttgcact 


29640 


gaggagtagc 


atagtttaaa 


atatatatat 


29700 


agtccaaatg 


tcatcagtgc 


tcattttgag 


29760 


aggaaatact 


caagttatta 


tgtgtataca 


29820 


tcatgatctc 


atagttaaaa 


ttgtaataaa 


29880 


caaaatttct 


aaaggcagtt 


tctgttgttt 


29940 


aggatttgac 


tatcattaga 


ttttgcttaa 


30000 


aaaggggcaa 


ctccatctgg 


tcctatagca 


30060 


tgaaaatgca 


aagaattgtt 


aaatgttctt 


30120 


ataactacgt 


gaggtgatgg 


atatgttaat 


30180 


tatatgtata 


tcaaaacatg 


tagtacaccc 


30240 


tacctcaata 


aagatggaaa 


aaaatcgaaa 


30300 


agagaaatat 


ggttaaacag 


taagttagaa 


30360 


aggagggcag 


aagcgtggct 


gagtttaaaa 


30420 


gtttgcccct 


tccctgtcct 


gtatgtcttt 


30480 


cagtattgta 


gaactgttaa 


aagaataaat 


30540 


tcccagaaca 


taaggaatat 


tccataagtt 


30600 


gtaaggactt 


gtttcgtata 


taccatctat 


30660 


tgacaagaga 


gtaagtcttc 


tggatacagt 


30720 


atgctctaga 


attctaaaac 


tctgtgactc 


30780 


agagtcatca 


ctaagggctc 


atctctgagg 


30840 


catgatttgg 


gcggcaatgt 


ttgctcactc 


30900 


atggagtcaa 


catcccagga 


cagaagggca 


30960 
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actcacgtca tcactataaa accaaacgaa 
catagctctc tgctggattt tctgaaggga 
5 cagtcaatag gtcttccaga aggggagaaa 
agatatgcag agattatttt cagggaggga 
atgagaaatt agtactgatc tttgaatccc 

10 

ctttttaaaa tatctcattg aggaagtgac 
taagaaacca taagtggaga ggtttgactt 
15 gcttcaaaac ccatattcaa agctccacag 
tttcatcaca aatgcaccca ggggagagct 
gcaaaaatca ttacttggtg tgccacagcc 

20 

ctgccagaga agttgcgccc tgagaaccac 
caagctgata ggcccactgg gctattctca 
25 gtgaaatgca agctcatccc aggttctcca 
gacaactggg atgagcccca attccaaaag 
gggggctgtt aggataataa taggatactg 

30 

tgaatagttt atagtccctg tgctcaagaa 
ctcaaatctg tctgatataa gattaattgg 
35 aataaaagtg aaggatttga acctgaagga 
ttgaactgag tctcttaaca atctgagaat 
atggagggaa aggtgtgcca tggaggcaca 

40 

cttggatact tgtaacctca tgtacccatt 
tgctctaatc attgtgggct ttggaactat 
45 aagacttccc cttgttgtcc tcacaggata 
tgtcgattag aagctttgga gaaattgcta 
agggtaatga attccataaa tatggccctc 

50 

ggcaaaagaa gcaactccat ttggtcactg 
caggtgctta aaatttgttt ccaaagccct 
55 taccaacctg ggagactaga tgcacaaata 
tgagctcaca agtagagaat aacttaaaca 
cagactctga aagcttaaaa agaattttaa 

60 

ccagagacaa ctttatgagt ggcactcatc 
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actgtattga 


ctgcatttcc 


ctacagacct . 


31020 


gagccaagag 


tcttgggggt 


aagtcctctc 


31080 


agaagtttat 


ttattcatgt 


atgtctgcag 


31140 


gttaattcta 


cttttccccc 


ttcaccttac 


31200 


agcagagggg 


gaagcaagaa 


actagtataa 


31260 


ttcagggtgt 


ggtggcacct 


cccttgaagc 


31320 


ccccttcatg 


ctcttaaagg 


aattgggagt 


31380 


tgaaccaaat 


gataaggcat 


gcaaaacgtt 


31440 


gtatgcacag 


cctccatcct 


ttgtacaaca 


31500 


cagagatgtg 


gcctggggac 


aaaatctaga 


31560 


tgctgctatt 


cttttccctg 


cagattccta 


31620 


tccctagttg 


ggtccatgga 


tgaaggggag 


31680 


ctctatgtca 


tcaaactttt 


taaattttta 


31740 


cctatactga 


gtgctggaaa 


atgtgggggt 


31800 


gtgagacaga 


cgaaaaaaca 


aatgtaaagg 


31860 


gcttataatc 


tgggggtagt 


cagacatgta 


31920 


taacaccagg 


gaagatagaa 


aataataaag 


31980 


tccaggcaag 


caaagtggga 


aggaaacaca 


32040 


gattgacaaa 


tgagagagga 


aggcatttgg 


32100 


ccctgaggga 


cgtggggtag 


cccacagggt 


32160 


tctctcctca 


caggctaccc 


agatcctgct 


32220 


atttgcactt 


aattacatcg 


gtttctccca 


32280 


tccattctgg 


ggagcactta 


ttgtgagtac 


32340 


taaagataga 


tttgaactgc 


ttcacgtcct 


32400 


caggagacca 


aaaaaggcag 


ctcccttcac 


32460 


cactaccatc 


atcattgtaa 


taactactct 


32520 


tgtgtgtata 


gtaagacatt 


tgctcctcaa 


32580 


tcctacttat 


tcctttgcaa 


ataaggaaan 


32640 


tatctagcct 


acctagcact 


catggaatca 


32700 


aattattatc 


actcatcctc 


cttcttagcc 


32760 


tagataatca 


tatagccata 


aatattattc 


32820 



-235- 



WO 02/062946 PCT/USO 1/48437 





ttaatacagt 


gctatttcta 


tccaatcccc 


acagcctgga 


ttcctaattt 


tgtgccataa 


32880 


5 


tatttaagga 


cagactcatt 


agagcttctc 


ctactataac 


atacactcct 


gagaagacag 


32940 


cagtgttact 


gtctgtttta 


gttgttatat 


tctgagaacc 


tagaatcatg 


cttgaggcat 


33000 




agtgtgcatg 


aacgaacgaa 


tgaatcaatc 


aatcaatcaa 


tcaacaaata 


ggtgaaataa 


33060 


10 


ttaataaatt 


aggccactat 


ctctgactat 


taaaaatcca 


cagtctcttt 


tgataaacag 


33120 




aaatactcca 


tgccttttct 


ataatgtgca 


tttcagaaac 


aaccagcagt 


gttttcattt 


33 180 


15 


tcccaatata 


aataatgtaa 


tattcaatat 


gattttcaaa 


ctatacatgc 


catactgtag 


33240 


aaaatcaaac 


agaccaaccc 


ctcactttct 


catttcacaa 


cagaattgtc 


ccagtgtgac 


33300 




tttgtgtact 


ctgcatgtgt 


gaattaaaat 


gagattctgg 


catgctgctt 


ttgaaaactc 


33360 


20 


tacttttcac 


ttaaaagata 


tttactgagc 


acataccgcg 


tcttcaatgg 


catcctaggt 


33420 




gctgagactg 


agaatactgg 


aaaacagctc 


ctctctgaga 


gaagtggtga 


attgttctta 


33480 


25 


gatttagaca 


gtttgtcatc 


taactgggag 


actagacatg 


aaaaagtcaa 


aagtaacccc 


33540 


aaatatttgt 


tcctacactt 


gtgctaagtg 


ctctaagagg 


agagttccag 


gtgcgatgtg 


33600 




agggtttaat 


agaggactca 


cctgaccatg 


gaggtcaggc 


aagtgctttc 


tgcgtagctg . 


33660 


30 


ctcactaggc 


gaagtggaag 


gaaagcatcc 


ctggcaaaga 


taaagcatga 


gccaagggct 


33720 




tgatgtggga 


aagagtaacc 


aaaagaagac 


caatgaatct 


atcgtgatgg 


cataaaatga 


33780 


35 


ggctataaag 


gtagggtgta 


accaggtcgt 


gccaacccct 


atagtcatgt 


taagaaaatg 


33840 


gaacttcaat 


gtgaaagcat 


gggaaagcca 


ctgaacgttt 


ttaaagagga 


tagtgacact 


33900 




tgaagacttg 


ttgaagtatc 


attgtgtttt 


tagagtgaga 


aattaattca 


attataacag 


33960 


40 


taaacctagg 


agaatccaat 


gagaggaggc 


tacaaacaca 


cagatgagga 


aagagtttcc 


34020 




cgaactagct 


tggtggcaat 


ggagctagag 


atggagagcc 


ttgagagtga 


ctcggaggta 


34080 


45 


gaagtgacag 


gacttggcaa 


tccaatgtgg 


ggagtgaagc 


ggagggaggt 


atcaaaggta 


34140 


actttctgat 


ctctgcaagt 


cagctgcaga 


tccttggact 


ctaagaccca 


gatgtcccca 


34200 




aggtccaagg 


ccagttccca 


cagagaaggt 


tcccatcccc 


agaacattaa 


aaaagtaagg 


34260 


50 


gactccacat 


ggagtgacag 


agttctcatt 


ttcctgaggt 


ggtactaggc 


tctacgatca 


34320 




tgggaaatga 


gggactctga 


taactacact 


agatatcagt 


ccccatatca 


gcctttagag 


34380 


55 


gggtcttcct 


tgagaaagtc 


tcccttcacc 


tgctcagaac 


aggcacagct 


cctcctcttc 


34440 


tcttcaaggc 


ccagagctca 


ggaatctctt 


tttggagccc 


cagccggatc 


aaacatctgg 


34500 




tcctctggat 


cttagaggag 


cctgcctgtc 


ccagttggcc 


ctggcagaga 


gcgcttgttc 


34560 


60 


agtgctgagc 


aaatccagca 


caaagtttct 


tcctacccta 


cctacgccta 


ccttccctac 


34620 
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tccctgatgc tcaggtggaa aaacgggatg 
aaatccaagg gaaattatac tgcaaggtgg 
5 ttgtcttctt aacagtttat tcttacagga 
cttctggtaa gccacttaaa ctacataaaa 
tttatgtatg aagcctttag taataaagtt 
ttgcagttca gatcattcct atttgccagt 
cttgttccca tttccctggt caagtaaact 
15 atccctgtaa ttcattgttt gtaattaggg 
cagaaatatt ggacaacatc tgaagacatt 
caagtgagta gaggccaggg acatttctaa 

20 

caacacaaaa attatccaac aaaaaatgtt 
aaccaaaacc aatcctaagc tctcactaat 
25 taccaaccga ggcttttctt tagatcctgc 
aatgctggat tgttctagga ttgaaccaag 
ggaaaagtgc ctacttctaa gggttatttt 

30 

gcaagatgat gcttaaatgt tcatactcct 
tggggatgtg ggcctctctg aaagtcagtt 
35 tcctctaggg atagttacaa ggcatagaga 
catccctgct tatggaaata catgcagaca 
ctgtattgac ttccttccag agctgagaaa 

40 

actcactgag agctatgcag atggaggtgt 
aggcatcaga agacctggct tttcatcaca 
45 cacagtcact tacttcctta attccattta 
ttcctacagc acaaaagtat aatgagaatc 
aaagcccaat acacatgagg gattgattat 

50 

gggagccaaa atagagcctt gagaaaacaa 
atctatatgt tcctgtgcat tttctgccaa 
55 gaggtaaaca ttaagtcaca agtacacttc 
cagaaaggac tggctgtgta aaggcatcat 
atcaataaag gtacagcaaa tacctgatga 

60 

ctttaaaaga gtgatatcag tcctgggagt 
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gtcttggaga 


ttcacaaaca 


gatattttgc 


34680 


tcaaacacat 


cacctgttaa 


cttttgtctc 


34740 


tacctcacag 


taaccgataa 


gaaatcaaaa 


34800 


tttaaaatct 


tctatttgtt 


ttatatcatc 


34860 


acctttcagg 


aaaatctgta 


tgcatcaaag 


34920 


gccactgttt 


gcctgacaga 


cattcttaca 


34980 


aaaagagttg 


tttattggaa 


accattctgc 


35040 


gtaattccct 


gccctgcacc 


ccccaggacc 


35100 


tttggtcatt 


acaactggtg 


ttattggcaa 


35160 


gcatctaaaa 


tgcacaaggc 


agcctccccc 


35220 


tatagtacca 


aggttgagaa 


accttgctct 


35280 


tcttcctccc 


aatcctagcc 


ctatctccaa 


35340 


ccaaggtctg 


gaagaaagag 


tttgggcatc 


35400 


gctatcactc 


acttctgtgt 


taacctgaag 


35460 


gagagtttaa 


tgagacatta 


ctgataaaag 


35520 


ttctaaccac 


tctgatcttg 


tttaaggtcc 


35580 


gaactctgtt 


cttatggtcc 


tctgccagtt 


35640 


cattctgagg 


ggagggcaga 


taaatgtttc 


35700 


atgcagatga 


agccccaaat 


gctgttctcc 


35760 


taaactatgg 


gtaaggaaaa 


cctactcctg 


35820 


gcaggaatga 


tggaggcagg 


atgctgaacc 


35880 


gcccttccac 


aaaccgtgaa 


agtgattgct 


35940 


tctgtaaaat 


gtggataaca 


acaagtgtcc 


36000 


agaagagact 


aggctgggca 


tgttctttgt 


36060 


ctagagatct 


tagtctataa 


aatgtcctat 


36120 


ggggactaaa 


gaccaagaga 


gagatggcct 


36180 


agtgcctatt 


tgcaataaaa 


acctgggatt 


36240 


tqaqqtttqc 


cacaccaagc 


cacaaacttc 


36300 


tgtattttcc 


accacggtag 


tcataacaca 


36360 


atggaaatgt 


gaagcttttc 


tttgtttttc 


36420 


gggtaagaag 


gaagaaatca 


gatgaccccc 


36480 
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ttcctgtaaa ggaactcgga tatacctacc 
aagaatgaat aaagggaatt gggtggggca 

5 

tcaagaaaaa acacctttaa actccagtcc 
ttctgttgat tattgcttga attctcccag 
10 tgtcacctct tttgttcatt tttgtttctc 
ggtgcacatt aaatatttat gaaataaatg 
gacacattca catttttaaa ggcagaaatt 

15 

aggtcctagt gaaacttgga cttgaaatga 
acacattgta ctgcttccag attatgaaag 
20 taaaatgaac attttcttta gcattagcca 
gtttgtcttc tgactttaat gagttagaat 
cctgatggaa caactcttcc ctgagaagta 

25 

tacaagatat ttttgctttg gtttcgatct 
cagggtcaag gtgtcacggg catgaatgtt 
30 actttcacca ttctcagcta cagacatcaa 
gaaatatgtg ttttcagtag aactcttttc 
tccctaaaat cttctctgaa ctgtgtccat 

35 

taaatgaagt tgtgtcagca tttgaactga 
tacaatggtt aaggagcgat attttctgga 
40 ttggcctaat attgtattct ctctaagctt 
tgagaatgct tacctcatgg gttgttgtga 
cagttaaata gtcagtaggt ttagctaata 

45 

tttactgttt cagcattaat gaacaaaatg 
caaaatacag tgggggaagg tatttagaag 
50 ttaagacata cattatttca cataattgtg 
tcttttacca ggaaatgtag ttgtctgtgc 
gtgacattat ttttaagcat gtcagtgact 

55 

caagtcctgg ggttttattt tattcctacc 
ttgaagaata tttgaatccg ttagaaggga 
60 ctcagtaaat aagaaatgag aagttagttc 
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ccacccaccc 


acacccccct 


ccaaaaaaat 


36540 


gtgtgaaata 


tttttattgc 


aacaatgagt 


36600 


aaataggtta 


acatctttaa 


atgttaacga 


36660 


ggaatgactg 


taaaccacat 


gattgcagga 


36720 


cagggcctaa 


tgcagtactc 


acatgtatct 


36780 


aataaatgga 


tactacccta 


aaagttaggt 


36840 


gagattccag 


gagattaggt 


tacttgccta 


36900 


ggtttcattt 


ccaagcaaag 


atatttgcta 


36960 


atggtggtga 


taaaatgaga 


taaaatggga 


37020 


tagagcatcc 


ttagcccatt 


ctaaaacttg 


37080 


ttagtttatt 


aagccttcca 


atcatactga 


37140 


tctggaaaga 


taaaagcact 


aaaaatttct 


37200 


tatttatttt 


atcattttta 


aactatcttt 


37260 


atcagctcct 


tggttgcgat 


aactgggatt 


37320 


gacaagtact 


gccagatgcc 


atcctttgaa 


37380 


attgtaagtg 


gtcttatttt 


gcatgaagaa 


37440 


gatgtaatga 


ttcactgatt 


tttattttat 


37500 


cggttgagaa 


gggtaccatg 


gggttactgt 


37560 


cttgaatcct 


gttcacacaa 


gcttgtcacc 


37620 


cggttttcct 


caagtggaaa 


atgaacataa 


37680 


ggattgctta 


aattaaaaca 


gtaccttgca 


37740 


caattattat 


ggccatagat 


tgaattaaat 


37800 


atctaaaagt 


ttgctcaata 


acactgagat 


37860 


tcatttaaaa 


aaataaggat 


ttttataagc 


37920 


gcaattttat 


tcattaaagc 


agatgagaga 


37980 


aatttaaagt 


atttatctaa 


caagtaatca 


38040 


ccaataggcc 


tttctctcta 


gttccctaag 


38100 


agctatatct 


tgaagcatgg 


gttctataat 


38160 


aataaagtgg 


aatcacagaa 


aatggtcttc 


38220 


ctgaatacaa 


gaggtaattt 


aatcctcaca 


38280 
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atcactcagt gaaatggtcc cagggactct 
agactcatga tagattacta aagaaagcca 
5 taactgggcc aagaacatga tacaacatta 
ggcaactact gtttgttatg ttcgtatata 
ttaaatctcc agatatatga aattccgtat 

10 

attagttacc tgcatacaga accatgaaca 
aacaatttct tgctgtaacc caaacatcat 
15 gatgtctctt gggctgaccc ctgtcctaaa 
gtaatcattt aaaaatgtaa atctgagcca 
tgattaagac ccaaatctta acagccttac 

20 

ttccaacatt attccaggac ttcatccttc 
aatcccttaa tgtgtcaagt atccacggcc 
25 gctttgaacc gtcaccaggc cagttataca 
tgaaatacca cttctttaag gaagttcttc 
agatccccca ttagagtagc gcatagctcc 

30 

tagttaggta aataattatc taattacttg 
gttctacaaa atccaagacc aggtatatct 
35 acaatagggt cttcatacat ttcagtttta 
gtgtattgca gggaaaccaa gattcagaaa 
actttggacc cacttttttt tttttttgag 

40 

gagtgcaatg gcacaatctc ggctcaccac 
cctgcctcag cctcccgagt agctgggatt 
45 ttgtattttt agtagagacg gggtttctcc 
cctcagttaa tcctcctgcc ttggcctccc 
gcgcccggcc aaccttggac tcactgtaac 

50 

tttgagacac cataaggtac catgttactt 
gggcaaaatg gcaatctgtg ggctatatcc 
55 agggaaaaac aaacaaaata ggcattatgg 
tggaattgag taatggatgc ctcctatgaa 
gttcacaatg actgcccagc tcccatggac 

60 

aagacacaag atgaaaattg tgcagtgtct 



PCT/US01/48437 



ctcatgtgtt 


ctgagataaa 


ttctaaaatt 


38340 


gaaaacaaca 


taagcgataa 


aaccctgtcc 


38400 


taattcttgg 


ccattgctga 


atggacagga 


38460 


cgaggaagaa 


cttgtatata 


cccgaaaggg 


38520 


ttcctgctgt 


ccatggatgg 


gtattgcaag 


38580 


atctcttcct 


agttctcact 


tacataactt 


38640 


aagctaaaag 


agccagaaca 


gtgggcttta 


38700 


gaccaatcta 


ttgtctatac 


tgtggctaga 


38760 


ttacttaatg 


gcttcctgta 


aaacgtatga 


38820 


caaaccctgt 


aacacctatc 


ctacctatct 


38880 


tcactcctcc 


acactccagc 


cacttcctat 


38940 


ccaagatgat 


cagaggatca 


gcttcttccg 


39000 


attaactcat 


cctcatactt 


aaatctcaac 


39060 


ctgacctcac 


ctctatgctc 


atatctgcat 


39120 


caatccttct 


tcacagcatt 


tatcacaatt 


39180 


ttattgtctg 


cctcccttac 


catgatgtaa 


39240 


tgttcaatgg 


agaaagtcag 


taaattgcat 


39300 


ccaacaaata 


ttgaatattt 


aagattttgg 


39360 


cacacagtta 


tttgttcaag 


gttaagtggg 


39420 


atggagtttt 


gctcttgttg 


cccaggtttg 


39480 


aacctccgcc 


tcctaggttc 


aagcaattct 


39540 


acaggcatgc 


accaccaggc 


ccagctaatt 


39600 


atgttggtca 


ggctggtcgc 


aaactcccaa 


39660 


aaagtgctgg 


gattacaggc 


gtgagccacc 


39720 


tcaggtcttt 


ctgtcttcat 


agtttaaact 


39780 


ctcaatataa 


acattacctt 


aaagcaaaat 


39840 


agcctgtata 


tatgttctat 


ctggctcttt 


39900 








39960 


tactattact 


ctctactctt 


tccctgttta 


40020 


attggagttt 


gttatccctg 


cctcatcaga 


40080 


gatggctaca 


gaaccatcca 


aaatatgggt 


40140 
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ttcccatact 


tggtgataga 


gtgtaagggg 


ttctcagctt 


atatatttaa 


aataacagct 


40200 


5 


ctcatttcaa 


ataccattta 


tttctcctca 


ttatttgcag 


ggaattttgt 


taatcttact 


40260 


gatcatcagc 


atagcagagc 


tcagcatctc 


tgtgactatt 


gcatccttta 


gaagcaagtg 


40320 




ctggacacag 


tcagatgagg 


tgtgttctga 


tgtctcttaa 


tattattaag 


agaacaggtt 


40380 


10 


cctgttcctg 


ttgggatact 


gcacacaatg 


atgtctgatt 


tattagtagg 


aaaagcagca 


40440 




tgctcagatg 


ctgaaataaa 


acatctaaat 


ttaggaaggt 


cagagagggg 


caagcttaag 


40500 


15 


attgaagctc 


accccaccat 


ctgaaacaac 


tatgctttcc 


tttcttgtac 


tatctgcttt 


40560 


atactcagtc 


atctctgtta 


agacatagct 


ccttatgtct 


ctgccaaaaa 


tatgtaatat 


40620 




cagacttaac 


aggaacccaa 


attactgtga 


catggggtgt 


atattcttgg 


atcataagat 


40680 


20 


gtgtgacaca 


gttccaccac 


cacaaaagaa 


ataggaaaaa 


aagctgttac 


tactgatact 


40740 




acatagtatt 


tataataatt 


gttatagatt 


atatttactg 


aattctgact 


atgtcagtca 


40800 


25 


ctgttataat 


cactgagcac 


caacttaatc 


ttaacaaaac 


tattataaca 


tctttatttt 


40860 


acatatgaag 


gaagtaaggc 


acaaagaggc 


taaactaacc 


tctccagtac 


cacataacat 


40920 




caggattctg 


acccctgtga 


tatgtctcca 


gagcctgctc 


actaagtccc 


aaataccagt; 


40980 


30 


acaggccttg 


tgatttatgg 


agatttcatt 


tctctctcta 


gctctctctt 


ttttttacat 


41040 




caactttata 


gaagtatcat 


ttataaacaa 


caaaatgtac 


ctatgtttag 


ctaccatcac 


41100 


35 


aatcaaggta 


tagtgcattc 


ctagcatccc 


ggaaggttca 


ctggaacccc 


tttgtagtca 


41160 


atgttctcat 


cacctagatt 


ggttttgact 


ttccaaaatt 


tcatattcat 


ggaatcataa 


41220 




gtaagtactc 


tctgtatatg 


gcttatatgg 


cttcttccac 


acagaaatat 


gtttttgaga 


41280 


40 


gtcatccatg 


ctgttgcctc 


tatcaagaat 


tcattccctt 


tatgtatggg 


tacaccacaa 


41340 




tttgtttatc 


aatttacctt 


tttttttttt 


tttttttttt 


tttggagtcg 


agacctcact 


41400 


45 


ctatgaccca 


ggctggaatg 


cagtggcact 


gtctcggctc 


actgcagcct 


caatctcctg 


41460 


gactcaagca 


acctttccac 


ctcagcctcc 


caaatagctg 


ggactgcaag 


catgcgccac 


41520 




cacccccagc 


taaatttttt 


ttgtattttt 


tttgtagaaa 


tagggttttg 


ccacgttgcc 


41580 


50 


caggctgctc 


tcaaactcct 


tggctcaagt 


gatacaccca 


ccatgacctt 


ccaaagttct 


41640 




gggattacag 


gtgtgcgcca 


ctgcgcccag 


tccatttacc 


ttttgatggg 


tatttagatt 


41700 


55 


gtttaaagtt 


ctgaactatt 


atcataagtt 


actaagaata 


tttgagttca 


tatatttgtg 


41760 


tggacatatt 


ttcacttctc 


ctggagtgtt 


gcatctagga 


tttgattgcg 


agatcgtatg 


41820 




gtaaccagat 


gtttaacttt 


ataagaaaat 


accaaacagt 


ttgtcaaagt 


ggctgtattg 


41880 


60 


cttttcattc 


tcatcaacaa 


tgtgtgagag 


ggacagttgc 


tccatatcat 


taacacttga 


41940 
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aactgtcagt ttattaataa tttagctatt ttagtgggta tgtagtggca cctcattgta 42000 

gttttaattt acatttccct aatgactaat gatgatgaca ctcttgctat atgtttatta 42 060 

tccatttgtg tatcttcttt tgagatgtct gttcaaatat ttagctgaat tttattggat 4212 0 

tatcttctta ttggccttat agcagcctaa ttaattctta atacaaatcc tttacaaata 42180 

ttttttcaca ctgtgacttg ccttttcatt ttcttaatag cttctttgaa aaaaacaaat 42240 

ttgatgaagt tccatttatc atttttgctt ttatggttat ggtttttgtg ttctgtttaa 42300 

caggttttcc ctacccccaa atcatgaagg ttttcttcta tcttttccta cagaagtttt 42360 

15 ataattttag cttttgtgtt tagatgtatg gtctatctta actttcatgt acagttgtag 42420 

ggtaaggtta gaagttaatg cacatccccc ctcatggaga tatccagttg ttaacgccat 42480 

ttgttgaaaa tgcaatgctt tcctccatca ccttggtatt tttgttgaaa atcaatttac 42540 

ttattattta tttgtaggtc tcttttggat actttttccc tctggccact acgtctatcc 42600 

ttccacgtat tctacactct cttcatttct gtgtctttag agtaattctt ttgcattgtg 42660 

25 gtgaaatata tataacataa tagttatcat tttagtcatt tgttcaaatc attgaataca 4272 0 

ttaataatgt tgtgtaactc tcaccactat ctatgcccca aaatttttca tcatccccaa 42780 

cacaaactct gtaccacaaa acaatcactc cccgattccc ctcctgattg cccctggtac 42840 

cctctatcct actttctata ttatgaattt gccttttcta gagacctcat ataagtggaa 42900 

tcttacaata tttgtccttc tctaactggc ttatttgagt aagcataata ttttcaagat 42960 

35 taatccatgt tgtagcatat atcaaaattt cattccttct atgactaaat aaaattccat 43020 

tttgtgtata taccatattt tgcttagcca ttaatttgtt gatggacaca gatatttctg 43080 

ccttttgcat attgtgaata atgctgttac gaacattggt gtaaaaatat ctgctcaagt 43140 

ccctgttttc aattcttttg gatacatacc taagagcgaa attgctggct tatatggagt 43200 

tctgtgttta atgttttgag aaaatgccaa attgtttttc accttcgttg caccatctta 43260 

45 catttccact agcagtgtac aaggattcca atttctccac atctccatca ataccaacca 43320 

acaggggaaa agggggagta attattttcc atttattttt aattttcaat ttatgattga 43380 

cacataattg tacatattta tggggtacag tgtgatgctt caatgcatgt tacattttct 43440 

aatgatcaaa tcatggtaat taccacattc atcactttaa acatcatttc tttgtagtga 43500 

tgacattcca aattttctct tctagtcacc tggaaatata cactacgttg taatttgcta 43560 

55 tagtcatgct actaagtata gaacttattc ttactatcta tagtaagtaa gaacttactc 43620 

ttcctgtcta gaattttata cccattgacc aatccttccc ggtcctccca tcctccctac 43680 

cttccctagc ctctggtaac cactattcta gtcttcactt ccatgaagtc aactttttta 43740 

gattccaaat gtgtgtgaga tcatgcagtg tttgtctttc tcagcctggc ttatttcact 43800 



30 
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taatataatg tcctccaggt taatgcatgt tgccacaaat gacagaatta cattctgttt 43860 

tatggctgaa tagtattcta ttgtttgtgt gtgtgtgtac attatatata tgtaatatat 43920 

5 

aatgtatatt ttatatatat atatattaca ttttctttat tcatttatgt gtagataggc 43 980 

atttaggttg attccttatc ttggctcttg tgaatagcac tgcagtaaat atgggagtac 44040 

10 agatgtttct ccaactgtct tcatttcctt tgaatatata ccctgtaatg ggatttctgg 44100 

atcatatttg tttttatcat ggccatcttt tttctttctt tctttccttc tttccttctt 44160 

tctttctttc tttttctttc tttctttaat ttttatacaa agactatgtc acttaggcta 44220 

cagtggcaca agatcatatg tcacttaggc tacagtggca caagatcata gctcactgca 44280 

gcctcaaact cctgggctta aatgattctc ctgtttcagt ctcctgagta cctgggatga 44340 

20 aaggcatgtg ccaccacacc cacataaatt tttattttta tttttatttt tgtagagaca 44400 

aggtcttgct atgctgtcag ggctggtctt gaactcctgc ctttaagcca tccttccacc 444 60 

ttggccttcc aaactgctgg gattaaaggc atgagtcacc acacccagcc ttatataatg 44520 

gccatcttaa tgggtgtgaa gtggtatctc atacttttga tttgctttta cctaatgatt 44580 

tgatgttgag catccatgta tacatcttct ttggagaaat aactgttata gtcttgcctg 44640 

30 tttcttaatt gggctgtttg ttttttgttg ttgaattgta gagttcttta tatattctgg 44700 

atcttagtcc ctcagtagac atatgatttg caaatatttt ctcctattct gtaggttgtc 44760 

tttgcacttt cttgatgact tttgatgcac aaaagttctt tatgaagtat aattaatcta 44820 

ttattttgtt gttttctgtg cctttcttgt cacatccatg aaattattgc caaattgtca 44880 

tgaagatatc ttccaacttt tttccatgaa atgtatggtt ttagctctta ggcttagtgt 44940 

40 cggattgatt ttgagttaat gtttgtatat ggtataaggt aagggtccaa ctctattctt 45000 

tcacatgtgg atgcccagtt attccagcat catatgttga aaaaattcct ttccccattg 45060 

aatattcttg gtgtctatgt tgaaaattga . tttactatat atgtgatata agtgagcatt 45120 

tatttctgaa tgattccatt ggtctatata tcttccctta tgccaggtac catatttttt 45180 

gggttttttt ttttgtttgt tttttgtttt ttgttttgat ggagtcttgc tctgtcaccc 45240 

50 aggctggagt gcagtggtga gatctcagct cactgcaaac tctgcctccc gggttcaagc 45300 

aattcttctg cctcggcctc ctaagtagct gggactatag gcgcatgcca ccatgccagg 45360 

ctaatttttg tatttttagt agagacgagg tttcaccaca ttggccaggc tcatctcgag 45420 

ctcctgacct caggtgatcc acccgcctcg gcctcccaaa gtgctaggat tacaggcatg 45480 

agccactgtg cccggccctg tattgtttta attagtgtag ctttacagta cgtttcaaag 4554 0 

60 ttggaaagta taatcactca aactgtatta ttctttttca ttattgtttt ggctattcag 45600 
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ggcctcttaa 


aattcaatat 


gaacttgagg 


atccattttt 


ccatttataa 


gaaaaagact 


45660 




gttggaattt 


tgatagaaat 


tgtattgaat 


ctgtatattg 


ctttgagcag 


cactgacatc 


45720 


5 


ttaacaattt 


taaatcttct 


tatacgtgaa 


cacaagatgt 


cttttcattt 


atttaggtct 


45780 




ccaattacat 


tcagcaatga 


tttatatttt 


ttagcataga 


ggtctttcat 


cccattggtt 


45840 


10 


agatttattc 


taaagtatgt 


aattatttta 


ggtggcatta 


taaatgaaat 


tgctttttaa 


45900 


atgttatttt 


catattgttt 


attgctgtta 


tacagaaaca 


acaatgattt 


tttgtgggtt 


45960 




gctcttgtat 


cttgaacctt 


gttgcaatcc 


tgtattaggt 


ctagtagctt 


tcctgtggat 


46020 


15 


ttgggaggat 


tttctatata 


ggaagatcat 


gtcatctgga 


aatacagatg 


gccttaatct 


46080 




cctcttttct 


aatttggatg 


aattttattt 


ctttttattt 


tctaatagct 


ctagctagaa 


46140 


20 


cttttagtac 


aatatttaat 


agcagcagtg 


aaagaaagca 


ttcttatata 


gttcctgatc 


46200 


ttagggggaa 


agcttttaat 


ctttcaccac 


tgagtatgat 


attagctgtg 


ggtgttttat 


46260 




aaataaactt 


tacatattga 


ggaattttcc 


atctcttcct 


agttttatga 


gaggttttct 


46320 


25 


catgaaaggc 


atttcatcaa 


atgccttttc 


tactccaagt 


gagatgactg 


tgtgggggtt 


46380 




ttccaacatt 


atattaatgt 


aatgtattac 


atttattgat 


tttcttattg 


taccatcctt 


46440 


30 


gtattttgct 


tatcaatttc 


acttcatcat 


atgaataatc 


cttttctaat 


attttgttgg 


46500 


caatttttgt 


atctacactt 


ataaaggata 


ttaatctgta 


attttctttt 


cttgcagcat 


46560 




aattatctgt 


ttttggtatc 


aaggtaatgg 


tggcctcata 


gaatgagtta 


ggcaattttc 


46620 


35 


ccttctcttc 


aagttttcag 


aagagtttga 


gaataattgt 


tttttttttt 


tctttaaatt 


46680 




ttggtagaat 


tcactggtga 


agccatctga 


tccagaagtt 


tcctttgtta 


ggaactttta 


46740 


40 


gattacgaat 


ctttttactt 


gttatggatc 


tgtaaagatt 


ttctatttct 


tcttgaatca 


46800 


ttttaggtaa 


tttgtgtgtt 


gcaaggaatt 


tgcccatttc 


ttatacagta 


tctgatttct 


46860 




tggcatataa 


gtttttatag 


tattctttta 


caatacttta 


tacttttact 


ctctataagg 


46920 


45 


tcagtaataa 


tgtcttactt 


ttatgtcttg 


ttttaattgt 


tttattgtct 


ttttgctttt 


46980 




cagtataggt 


aaaggtttat 


caattttgtt 


gatcttttaa 


aagaatcaac 


atttgggttc 


47040 


50 


attgattttc 


tctattgttt 


ttcaattctc 


tatggagatt 


tctcttctgc 


cccactaaaa 


47100 


ttatgctaga 


gtaattcttg 


aaatcagata 


gtataaatct 


tccaactttg 


ttatttttca 


47160 




aaattgtttt 


ggctgttcta 


ggtcatatga 


atttccacat 


aagttttaga 


attcgtttat 


47220 


55 


taattactac 


aaaaat cttg 


ctgggatttt 


aattaaaatt 


atactaaocc 


tatagataat 


47280 




ttataaacaa 


tttacttctt 


agtaatatgt 


agtcttctga 


tccatgtcca 


tgaaacatgt 


47340 


60 


ctgtattttc 


ttacatctca 


tttcatttct 


ctcagcacta 


ttttattatt 


tacaatgtat 


47400 


aagccttatg 


catactttgt 


taaatgtatc 


tgtaattatt 


tcattttttc 


ttgatattgt 


47460 
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acatggtatt tttaatttta ttttttaaat 
tttttatata ctggtctact atgacctttc 

5 

ttttaataga gtttctagaa ttttcttagt 
tttttactcc caaactttat gcctcatttt 
10 tacaatgttt aaaataagtg atgaaaatag 
aaaatgttca atattttacc cttaaatatg 
tttaccctgt tgcgcagatt ttttaaaatc 

15 

atctcatgct ttcatcttta tcgtattatc 
accctgtttt aagcccttta gaaacaatac 
20 ttgaatttag tgaatacaga tctttaatcc 
caggggattc aaaaataata aaatatgtac 
aactctacat ttacggagaa atttctctgt 

25 

atttggtctt tagtcttagc caccctgaag 
agcatagatc agagaggaaa gcacaggacc 
30 ccaagtaaac agctctacga tactggtctg 
tcaattctaa aagaaggata ataacttcat 
ttggaacatt gtctagcatc tggtaatcac 

35 

aaatcatagc acttaagagc tttaataaac 
ctttttaagt ggaagaataa attaaaattc 
40 atatagaaat ctcttccaaa gtaccattta 
aaaatatagg tgtgaaaaac taattagaag 
atgtgtagat tctcaatcta ggcctgctat 

45 

ttaatcactg agttctgatc cacttcatca 
attatctcag aaaggtgtaa ttcatggagt 
50 taacaacctc ctaaaaatta taataataaa 
tatcagatgt ggtcacttag ctgtttaaaa 
gaaagtccaa atttcctagc ctagcatacc 

55 

taccatctgc catcacacca tgctatatca 
ctgctgcttc tacctggatt ctgactcctg 
60 taattcctac tcagctgtta agatatagat 
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gttcattccc 


acatatagaa 


atgcaattta 


47520 


taaactaact 


tattagttcc 


catagctttt 


47580 


agataaccat 


tttaagtatt 


aataaagaca 


47640 


tcttgcccta 


ttacactgtg 


tagtctccaa 


47700 


caatctttcc 


ttgttcttga 


tcttagggtg 


47760 


atgttttttg 


taggtttttt 


acagagtaaa 


47820 


atgaatgggt 


attggctttt 


gtcatttttc 


47880 


tgtatattac 


acaactagag 


gagagtacaa 


47940 


taaaataagt 


tacactaaat 


aactcttcaa 


48000 


actaatggca 


cagtaatgtg 


caaagtacaa 


48060 


cttgatctca 


agaaaaggaa 


agggaaagtg 


48120 


gccagtttct gtgcttgaaa gtattatttc 


48180 


taggaagtag gtgacgttat 


taatacaaga 


48240 


tcagattcaa 


ctacaatttt 


taaatccttc 


48300 


gttatttaac 


atctctgaac 


ctcaattccc 


48360 


agggatgtca 


taccaatttt 


tgtgatgact 


48420 


tgagttctca 


ttaatggatg 


aggaaattta 


48480 


agagaaacta 


taattatttc 


ttccaaacct 


48540 


acatgaaaca aagtttgagg 


attcttagaa 


48600 


tactcacata 


aatagtcctt 


caatagggat 


48660 


aatgggaaat gcaatgtatt 


atttcagaga 


48720 


tagcaaacga ggtaagcata 


attaattcat 


48780 


aatagaacaa gtagtctatt 


ctgactatga 


48840 


cagaataaaa agcagcaggt 


tgtgctgcag 


48900 


atgagaacta atgggaagca 


ttgcaggttg 


48960 


aactccaatt gcttcctctt 


atttgtaaag 


49020 


atctaataat 


atcagtaata 


tctaactgct 


49080 


caaccttgtg 


tttttgtacc 


cgctattcca 


49140 


tcttatttca 


tcttaccgtc 


tatgcctgag 


49200 


taaatattgc 


ctcttctgta aagcatttcc 


49260 



-244- 



WO 02/062946 

aggttgtctc agaattaggt catcttctat 
gtaattgcat ttgctgaatt gtattactat 
5 tgtgagcacc ttaagggaaa atcccacatc 
atctgggtga tcaataaact attatagatt 
atataaaatg tctcagaagc tttataatat 

10 

taatcaaatt cctctgttca atatttgaaa 
ccaataagtt aatggtaaag gcaggactgg 
15 tatttcctat ctactgtctt aagtcagaaa 
tcagacatga ttctgtcata ttaattaccc 
ttgccttcgg atgttactca aaatagtgaa 

20 

caatttgtgc ttcaagaaga gttttccagt 
atctttggag gctatgcttt cttcaagtta 
25 ccaggtaata aaggtagaga atttgtgcca 
cccaaatttt cagaggaaga aattgaacct 
gaaaatatgt ccattcagct agactctaca 

30 

tctgctattg tacaaccttc tcaaatgcaa 
ctccaagttt ttccatccca ttctgcacta 
35 ccatcccaag ctctaccagt agaaggcctg 
tcatcccatg tcaaacagtc ttctaatctg 
ctatcccaag acacatcatc tcaagatatg 

40 

caatccctag atatgctatc tcaagacaca 
ccttcccaag atatgctatc ccaagctcta 
45 acatcccata ttgtgcagtt ccctgaaata 
caaccagaaa acactgaacc tcaaaaccag 
agatcagaag ttatggaaga gaccaaagaa 

50 

aaatcctcaa gacggcattc cttaaaccag 
cattctttag acgtgcaagc caaaggccag 
55 caaagcaaag gctggcaatc tccaaagcag 
ctatccccaa agaggcactc cgtagataag 
ctcccagatc agcaagctga agatcagcaa 

60 

tctaaagatg gacaagttaa agaccagcag 
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gttcccataa 


agcactaagt 


acctatttat 


49320 


tatctattta 


tatgttgatg 


tacagatagt 


49380 


taattgtctt 


tgattcccag 


catttagttc 


49440 


aagcatgaat 


atatgaatat 


tgactagatg 


49500 


taagtatggg 


aggaattttt 


gaggcaatct 


49560 


gactaaaagg 


tgatgacttc 


taaaataata 


49620 


cattcgtttt 


ctgccttctc 


atttagatgt 


49680 


tgaaaagtaa 


cttggaaaac 


aaaaaaaaaa 


49740 


tcatccttgc 


ttttataggt 


tctgtttttc 


49800 


caacctgccc 


cagaagaaaa 


tgatcaatta 


49860 


gatgattcaa 


caacaaatgc 


acaatctgtt 


49920 


acactctcta 


ggagtccttt 


agtctcccaa 


49980 


gatgaacaaa 


agcaaagtat 


ccttccatct 


50040 


ttgcctccca 


cactagagaa 


aaagccctca 


50100 


tttaaacaaa 


tgaaagatga 


agatctacaa 


50160 


accaagcttc 


tgcaggacca agctgcgtca 


50220 


aaactcgaag 


atatatcacc 


tgaagacttg 


50280 


tcagaacaaa 


ccatgccatc 


taagtctaca 


50340 


acagctaatg 


acctgccccc 


tcaaggcata 


50400 


ctgtttcatg 


acatgacatc 


ccaagatatg 


50460 


ccatcccacg 


ccatgccacc 


tcaagacata 


50520 


tcagcgcatg 


ccatattacc 


tgaagcctca 


50580 


caacacctac 


ttcagcagcc 


cccagatctt 


50640 


caaattttac 


aaatgtcata 


tcaagatatt 


50700 


tggaaatctg 


aggaggaact 


ccatagaaga 


50760 


caaaccaaag ccttgcaata 


cttaaggaga 


50820 


aaatcctcaa 


agaggcattc 


cttagatcag 


50880 


aaatccttag 


accagcaaat 


caaagactgg 


C 0 Q4 A 
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caagctcaac 


ttaatcaaac 


taaagagcaa 


51000 


gccaaagggg 


aacaataccc 


agaaggacaa 


51060 


actgataagg agcaaaactc aaagaagcaa 


51120 
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acccaggatc agcaaactga agaccagccg 
ttccaaaatg ttcaagccga aggacagcaa 

5 

tgccaagatt cagaatccca aatacagcaa 
ctccaggctg gacaacccag gactgtcaat 
10 ctcagggctg gagaaacaaa gattataaag 
ggcaaacaca ggatctatta gagaaagaag 
tccaaaccca gcacgcaaca gcccaacata 

15 

accaacaaga ccttcaatcc agagttacac 
aaccagggga catgaaatgt atagggcaaa 
20 aggcagattt tcattcttct tctggccaaa 
tgtccaatct agattcagaa caagatgtgc 
ataaagaaga tgtgaattta acttctactt 

25 

aagactctga ctaacatgca gaatctaccc 
aattgggaaa aacaatattg cctcctccaa 
30 ttaacttaca aaaaaatgaa gggcatgctg 
taaaatgaca acaaaccaaa tgttaacact 
attcacatgt atattccttg tcatcaaagt 

35 

tttataactt aatgtttaat ggccctactt 
cttttggcat tataggagtt cagaaaatca 
40 gggttcaaga accaaaagag tcagcaagga 
tgtttccaat cctcttattt actctccctc 
agtttgctca attattaatg aggtagaatg 

45 

tgggagggat taagagatat ctagagacag 
cagatgtcat aatagaaaat atgacatact 
50 tctttatgcc tgcatttact gattataatt 
tctgtcatca aaggaaattg tgttatcaaa 
aatcgtgttt tttcaatcct ctttcccttt 

55 

cccttttctt tcaattttta tcacccccat 
aaaattctat aataggtatt tctagggatc 
60 acaattcact attacagttc tctcaccttc 
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gcccaagaga agaaatcccc 


gaaaggacaa 


51180 


gctcaggtgg agaaagtgcc 


aaaactgtta 


51240 


taccaattct ggcaattcca 


caaaggcaat 


51300 


cttttggcca agaatcccct gactggataa 


. 51360 


cacgagaatg gcaatttgaa 


atgaagcact 


51420 


ccctaaagca gaaagctcta 


taccaagaag 


51480 


acctagaatg tcaagacact 


caagataaag 


51540 


aaaaaggaga tatgtacact 


agagacatca 


51600 


cctcagggga cctgcaatca gaagacgtga 


51660 


gctcagtaca agacacatgt 


ttagcctatt 


51720 


aaccagacac ttcagcttcc 


tcaaattcat 


51780 


catgtgatcc aaaagatcaa 


cagcaatctg 


51840 


aataccacac tgcccccatt 


aatggaatta 


51900 


tctgtgttct caactgtggt 


tgccacctca 


51960 


agcactcaaa caatttgttc 


ttacttaaaa 


52020 


gtatcatcac tttatgtatg 


tgaagaaata 


52080 


tagtgtttca gtttataatc 


acgatttcat 


52140 


tggtatgatg ctagtatttt 


tcatctcccc 


52200 


accacaggga agtctatgaa 


ggctctctaa 


52260 


acaaaaattt ctcccatttc 


cctgtcttgc 


52320 


ctcatattta tgttgaggat 


atatggactg 


52380 


tggcaatgaa tacagaatgt 


gatgatggag 


52440 


gaaagtagaa attagggaag aacagagatg 


52500 


gtattgtatt cagtgaagct 


tcagctccct 


52560 


tctcaaccct tatgatacat 


aatttcatcc 


52620 


agctatcagt tacctcaatc 


ttccaagcct 


52680 


atctctgata catttggaaa ggttaacaat 


52740 


taacttctgt gttttgcact 


tcctgtttca 


52800 


tctccccaac tttattcatt 


ttttgtaatg 


52860 


ttgtgaataa tccagactgt 


gtcatatttg 


52920 
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tgtcttctac agaatttcag cacaaaggct 
ttaagggcat aaatgggagc ttggagtctc 
5 agtagtataa taagatacag agctccttat 
ctttgtagta aaatattagc ttcctgagct 
ttacctgagt tagcctttag ttcctgaatg 
ataatactaa taatttctac cttttccttt 
tgtagagttt aactttcagt ctacaaactt 
15 tatatatata tatatatgta tgtatatata 
tatagagaga gagagagaga gtgtatatat 
ttgtatcaaa ttctttgaga atccagccca 

20 

tctacccaac atacacaaat acatttctgc 
tagtgccaag tcactggaga ggcaatttcc 
25 actgagaaac ttataggcca agttcaaata 
ctatgctaca tttggaacat ggaaattttt 
ttagacttac aatatgagaa tggaagaatg 

30 

tcccactgag ggagatgcta aggggcaagt 
gtaacctttc tagtctacag aagactctct 
35 ttgacctccc ctttctgaga cacctaatac 
gacaaaatca aatgtgttta agacccataa 
aagacaaaat gcaaaaggct gtgggaaact 

40 

taattttttt attaaaaact gtcttattac 
aggcttcttt gtagtaaaat tgggaaaagt 
45 agttcccaag aactgtggaa aaaaaaataa 
ctagaatcaa caacctacag ctaaattata 
agatggtaac tattgccaat ggaatgtaaa 

50 

ctacaatatt tttttccctc atgcactggc 
cctgagtctc tagagaatgg agcctggtcc 
55 gaatattcaa ctggattgtg atctaaaaga 
agctgtttgg ctagaaggta gtctaagctg 
caaatcttgc tcaaaatggc aggactccta 

60 

ctgcataaca aattaccaca aaggtagctg 



tgattcaaga 


atacattcaa 


taaaacattt 


c o o o rt 

52980 


tacttagagg 


tgaacagaga 


agttatatgt 


53040 


gactggctac 


atgttccttg 


ctgcctgtgc 


53100 


tcacatgatt 


gcttgacctt 


gatagagtcc 


53160 


accaccacct 


ggtattttaa 


cttactctga 


53220 


agctagatta 


gcacgtttct 


acagtgctta 


53280 


gggtgtttag 


gaagtatata 


tatatatata 


53340 


tatatatata 


tatgtatgta 


tatatatata 


53400 


atatatatat 


atacacacac 


acacacaatt 


53460 


cacagttact 


cttgccagtt 


catccctata 


53520 


agaaatgtac 


aaagatcctg 


gaatcttaaa 


53580 


atcatcagcg 


actagtccac 


accagttacc 


53640 


taccgtatgt 


gtcctgtgaa 


cttgttgctt 


53700 


ccaaaattgc 


aagaatacaa 


cgcctatagg 


53760 


taaaaaacca 


cttcttctga 


ggcctttcca 


53820 


cagaagctcc 


caaggccccc 


agaacccata 


53880 


tttctctctg 


tttcttgaag 


gcattttccc 


53940 


tgtcctatca 


atgagtttgg 


gtctgaagtt 


54000 


ggaaaaataa 


tgtttgaagt 


atctgtttaa 


54060 


agaaaattta 


tgatttgctt 


aaaggcaacc 


54120 


aaatataata 


tgtgtttcct 


agaatatgtc 


54180 


tgatttgaga 


tactgctacc 


ttacacactc 


54240 


gctggctgct 


tacaattgca 


ttttcattta 


e a ^ f\ e\ 

543 00 


cttcccagcc 


accttgcatc 


taggtaggtg 


C A "3 c r\ 


gtttgactcc 


taaaatcacc 


aacattatat 


54420 


tagatgcaga 


gaatctagga 


tccacaagac 


544 80 


ataaatcgtg 


gtgtggatta 


cttccagttt 


54540 


aaaacccata 


ttagggcaag 


gaatgcatgg 


54600 


agtaatagag 


tctttaaggc 


acaaaaaagt 


54660 


ctcgtatctc 


cttgtgttag 


ttacctattg 


54720 


cttaaaacaa 


caagcttatt 


atctctcagt 


54780 
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tcagaagccc aggaaagctt ggttaaatta 

gaggactggc cagggcagtg gttctggggt 

5 

ggaagaatcc agtcctcacg gctgcaggac 

ctgggaactg ctcctgcatc ccttctcatg 

10 tgtgtcaagt cttctcaaac ccaaatctga 

ccttttatta cactagactt actggaataa 

ttaataacct taaatttctg tgcaaagttc 

15 

aaagattatg ggctcccata aatagtcaca 

tgttttccta atttcttaga aacatgtttg 

20 gttttaaaga gattactgtg gctgcagttt 

gcagggagtc aattaggaaa ctcttaaaat 

gatgtataga agagtagaga ctaaaatgat 

25 

agaaactaga ggtgtagaat tgaagaaaat 

gattagctgg ctgagacatc attaccatca 

30 caagagtaag gggcatagtc agttctttct 

gaaacagaac caagaggatt agacagatag 

tagatatata gaagatagat agatagatag 

35 

gatagataga tagatagata gatagataga 

ataaggctga gaagtcccaa gatctgaaag 

40 tccagcctga gtacgaaggc ctcagaacca 

tgaaggtagg aaaagaccag tgtcccagct 

ttgctcaggc tttggtttta tttaagccct 

45 

ttaccaccgt tcatggaata agatttaaag 

aaagaaaact agaggatctg gagactccta 

50 tgctctagct ctgctcccag agttgttctc 

gaaggcttta acagcagaga aaaataagag 

gctgcagtaa cagaagaatg agagattaaa 

55 

gctttattca gtttaccaat gtaaaaacct 

gaataaagtt taaccaaaag tcttggcacc 

60 accatcaaaa ggccatcctt tgtgaacctg 
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tttgtacaga 


atatcacatg 


gctaaaatca 


54840 


tgagctcctc 


ttcaaagttc 


atccattgtt 


54900 


tgaggttccc 


atttccttgc 


tggctgtcaa 


54960 


tgatcccttc 


catcttcaac 


caggtaacag 


55020 


ctccctctcc 


tgcatctggt 


tttaaggagt 


55080 


tccaagataa 


cctattttag 


gataaattga 


55140 


cttttctcat 


tgaagataac 


cacaattagg 


55200 


tggaaaaata 


cacacaattt 


taaaaattaa 


55260 


acttttaaag 


caggaactaa 


ctgaattggt 


55320 


ggaggatgaa 


ctggagaaag 


aaagatggca 


55380 


agtgaagaaa 


gaggtgctaa 


gtgatgtagg 


55440 


aattcaagag 


taaaattgtc 


aggatgggac 


55500 


gaagaataaa 


ccatctgctc 


tagcttaaaa 


55560 


acaaaacaga 


cgaagaggca 


caggttgcag 


55620 


agtgtagtgt 


attaggcagg 


gttctccaga 


55680 


acagagagct 


atagatagat 


atatagatga 


55740 


atgatagata 


gatagataga 


tagattgata 


55800 


tagatagata 


aggaattggc 


tcacatgatt 


55860 


ctggagaccc 


aggagagctg 


atggtatagt 


55920 


agagtttaaa 


ttccagttca 


agtcttagtc 


55980 


caaagacagg 


cagagaaagc 


aaattctctc 


56040 


cattggagac 


attgagagta 


acactgggtt 


56100 


acactattgt 


tatttataat 


taaaaaaaaa 


56160 


agcaggtgac 


atcttgccac 


aacagttaac 


56220 


tccagagtat 


ggatatggat 


ctttttaaag 


56280 


aaacttagaa 


acaaagaaaa 


ttgactttca 


56340 


tgaggcccac 


ccatattggg 


gagggcatct 


56400 


catccagact 


catcctcata 


gacacactaa 


56460 


tattgttcaa 


tcaagatgat 


gcataattta 


56520 


acactcatat 


gcatctcctt 


aagccatact 


56580 
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taatctccaa ataaagacta aaaggtgata 
atacaactgg aaatgtacca acctcttctc 
5 ttactcttct ccttactgtc tagcaacttc 
caatactata atataaacat gccttatgtt 
gtgtgtgtgt gtgtgtgtgt gtgtagtttc 

10 

ttataactac cttattctac tacccattct 
gctggccatg gttctttacc tggtagaatg 
15 caattagtgg tcctgcctgg actgagttgt 
atcatggtaa tactaagaga tgcatgaagc 
ttatgtccat tgtggaatag tggtctaatt 

20 

tccagcacag taaccccttt cagttatctg 
cagcaataaa cctataggag cctaccttcc 
25 attgacactt caagcaccat tggatctgtt 
cacggaagcc gaggcctgtt gtgaagcttt 
aagtcacttg gcaaattggc cagagcaaca 

30 

atccaaagaa actcattaga tgctgtgcct 
cttatcctca tgctggaaga gattctcaat 
35 tcactgaggt acaagaccct gaggttttat 
atatcttatg aataaatcta gagtagttac 
aatgtcatca acctaatgga ccagtgtgat 

40 

cctgtgaact aaattatgac atagagctat 
gaatgtgtat tgctggcctt gtcatctgaa 
45 agctgtccag aaaaagcatt tatcagatca 
ttaatttgct caagtaatga aaccacattt 
aattggtaat ccactgtcat cttccaagat 

50 

aagttgaata ggaatgtggt gggaatcacc 
tcattaatct ctgtagcccc tccaggaatg 
55 agttactgat ggatttatta atttctgcag 
ttactatttt cctaggcaga ggcagtacta 
ggtccttact ccacagagta agaaaataat 

60 

gttttaatta tgcattctgg aattggggga 
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attcttcata atatgttaca actatcatgt 56640 

agaagaaggg gtaaagtctg tgaatgatgt 56700 

tatgatttaa attgtatgat gtaaagttaa 56760 

acatgattag agaatgagaa agaacagatt 56820 

tataactgga tatgtggacg taactcatat 56880 

gtagtccctt tgccttcagc aaacacctta 56940 

acccaaacct tcattcctga aaagcctgga 57000 

tgtggttttc ttaattgacc ttaatcacag 57060 

gatctcctat attccaagca tagtgttccc 57120 

tccccttagt catcagaaac aatcacccca 57180 

aagaggatgg aagggctttg ctccaaaatt 57240 

cagaagctct atatagcatc cctatctgcc 57300 

gatggtatgg tccaagcaga agtcagctta 57360 

tcttctggtc ctactcaaaa gcagctctgt 57420 

cacccaagtg aggaatatgt tgcctccaaa 57480 

cctattcatt gtggaagcca gatgcgacaa 57540 

attccccaca ccacttgatc cctagaaatt. 57600 

cagatttatt ttccacactc tggcatgtaa 57660 

tacgtcttat tcgctaggtc caatcagcat 57720 

atcttgtgga aggaaagggc aattaagatc 57780 

aaagttgata tacccccgag gtatgacagt 5784 0 

agcaaactgt tcctggtggt ctttactaat 57900 

ataacattat actgggtact agaggatgtg 57960 

ggtaaggcag ttgcaattag agtcacctcc 58020 

ccatctgtct tctgcacatg ccaaacatac 58080 

acccctgcaa ctttcaagtt actgatggtg 58140 

tgttgggagt caccaaccct gcatctttta 58200 

taccatcagg aatgtggtga tgcttttgct 58260 

gaacttccac ttggtattcc ccaccataaa 58320 

atgagatcct accacctatt gagtatattt 58380 
ataaccgcag gatgggtttg ggagcccact 58440 
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gcacccattg tgagattaac cagagataaa 
cttactctga gtggtaagcc acagtgatgc 

5 

gaatcagtgt ctagtagtcc ctgagaggtc 
ccctggaaaa aggctgcagg tccctttgag 
10 ttcttggtaa tgcaccaggt ccttccttaa 
gttctttctg ggtttgtaaa ttttattgca 
gattgagggg ctatgagtct atatttttat 

15 

agaacttttc tgcttatata gatcaagggg 
tctaggaaca ccataatcaa ctagcccaca 
20 tcttatccat acccaccctc acagacacag 
cagtctatga cctggtcaag ttgacacata 
cagtcttggg gccacaatct ggaatgagtc 

25 

agctgctggg atgcaaatgt atactaagca 
acaaaaatgt attattttaa aacatggatc 
30 tatttatgca acctctcact tgtggttatc 
tttctgaaag gctgttactg ttagtgaacc 
cagatctcct ttgcagccat aatatcctcc 

35 

ggaaattctc atctctcact ctgcttcccc 
tccctgtggt ccctgtgatt tttttttttt 
40 aaatgtggag tttctggtca gagagttaaa 
tgtcaaggca aagctctgaa gcatcaaatt 
ggatcataat tttgacccag caccattact 

45 

tttccgtgcc taagttgagt agtactcagc 
agaatgtata gtgtttaaaa tatctgtcac 
50 aaacatctat aaagctaatt cttttactag 
tctaatttat gccaccaaag tatctctctt 
tattgagtga actgcatccc actctaaatt 

55 

tatctggaga cgggcttttg aagctgttgg 
catgacggga ttagtgtcct tatatgaaaa 
60 ctgtgtgtgt gtccttgtgt gtgtgctcac 
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actccatgga tcatcggacc 


tccacaacca 


58500 


tttgagtctc ctggaattag 


tatcaattca 


58560 


tggttatttc cttttcccca 


atgctcagta 


58620 


gaaggctgtg agaaagatca acagtacaaa 


58680 


ggagacctgg tctccccttt 


cattcaagga 


58740 


ggtctgtaaa ctggctcagg 


tctgggaatg 


58800 


gattcaggtt aggctttcat 


tcaatgacct 


58860 


taatttaaaa ggatttctgt 


ctattttacc 


58920 


cccacaccat aggtctgatt 


caaatattaa 


58980 


ccaaaataat gttttaatta 


agtatttgag 


59040 


aagttaacca tcgtattttg 


tttatactca 


59100 


ttagtcgaaa tcacctgcac 


agatcagagg 


59160 


agtactcatt gagatatagc 


attagcagga 


59220 


cagagatctt ctctggaaaa 


taataatcta 


59280 


atgcctagaa aggcatgcag gcccatcaca 


59340 


acacagacac aaagagtcct 


gaggaaagtg 


59400 


atcagttcac aaggtactct 


ttggacattt 


59460 


cactaacaca tacacacata 


cacctttctt 


59520 


tttttacttc tctgacctac 


cctaggattc 


59580 


gttttatgca tcactggaga 


aagctttagt 


59640 


gagtacatgg aatctaaagg 


taaagcaact 


59700 


agtgggatac tttgggaaag 


tttcttaaaa 


59760 


acagggttgt caagaagatt ggataattaa 


59820 


ggagtagatg ctcaattttg 


ttccataata 


59880 


tcggtattac ttcttatggc 


ataaacttga 


59940 


tattctgaat tatacaaatt 


atctccctgc 


60000 


catttgttga agttctagct 


cacagtgtga 


60060 


gtttagatga ggatatgaga gtgaatccct 


60120 


gacaccaggc ctctgtctcc 


tgctcttttt 


60180 


atgcctgtgt acacacaaag gaaagatgat 


60240 
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gtgaggacag aaataataca gtcatctgca 
agttagctga cactttgatc ttggaattcg 
5 tgctgtttaa atgaagcagt catggtaatt 
accttttctt cttgcaaatt gggaaaggga 
atgaaacaaa gaaacttttt gttctttaaa 

10 

tatctaatac ctatgtgtaa gtccgaaatt 
agtaggaaat ccctgttgct gcctttgtag 
15 ttaaagtatg ctatatatgc aagactgtgc 
aatgaaaacc agcaggacag tgcagtgaga 
atgcagagaa aactaagaat tcaatagatg 

20 

cgatagaagc cccagtggtt ccccttccct 
tggagacatt aagagtgaca ctgggtttta 
25 ctattattat ttataaactt taaaaacaaa 
cgggtgacat ctggccacaa cagttgactg 
caaagtatgg atatggatct ttttaaagga 

30 

acttaaaaac aaagaaaatt gactttcagc 
cagtgttttt cttggaactt tccagcaagg 
35 taatcccatt gactatggtt cagagaactt 
aatacttagc aataagaagc ttgaaattaa 
acatattata tattattcca taaaccacat 

40 

aggttttggc agattggaaa ataagaagaa 
tgaacagata catgtcatca atgtataaag 
45 tggagttcca cagccaccct gaaatggaga 
ggcacggtgg ctcatgcctg taatcccagc 
aggtcaggag atcgagacca atctggctaa 

50 

caaaaaaatt agccaggcat ggtggcgggc 
ggcaggggaa tggcgcaaac ccaggaggca 
55 gcactccagc ctgggcaaca gagcaagact 
tgcactgatt cagtgtgaca tttccagtat 
ctataaagaa tcaacaaaaa atgaactttg 

60 

tatctgctca ccatgctaaa aatggcaata 
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agccaagaag agagctctca taaggaacca 


60300 


cagcctccaa aagtatgaga 


aataaatttg 


60360 


taccatagca ccctaagcag 


aataatatac 


60420 


gtgaaggcat aaaaatccca 


tctgaccaca 


60480 


cataaaattc ctaggtagtc 


acaaagggaa 


60540 


agtaattaga gtgtttatgt 


cggaagatac 


60600 


gttctgtaga atggtcttct 


aatcttattt 


60660 


tactactacc ttagaggagt 


gaagaatcag 


60720 


aaaacagtca tcacttacag 


aagcccagac 


60780 


cagaaaatca gagcttgaga 


ccactccaca 


60840 


cgtattcgct tgctccttcc 


aagaaatgtc 


60900 


ccattattca tggaataaga 


tttaaagaca 


60960 


agaaaactag aggatttagg gactcctaag 


61020 


ctctaattct gctcccagag 


ttgttctctc 


61080 


gggctttaac aacagagaaa 


aataagagaa 


61140 


tgcagtaaga gaagaacgat agaaagaata 


61200 


gaatcagacg agaggctgtg 


aggaaaggcc 


61260 


ggattcctca gatcatcata 


acttctctgg 


61320 


ggctgaattt cagaacatta 


atgaatacaa 


61380 


taagactaga gtaatttgca 


ttaaaagagt 


61440 


atatgaggat atggaaaatt 


tttgaagtag 


61500 


gggcctaggg catttattag caggaggcca 


61560 


aattgtaaaa taacgtctgt 


actggggccg 


61620 


actttgggag gctgaggcag gcagatcaca 


61680 


cacggtgaaa ccctgtctct 


actaaaaata 


61740 


acctgtaatc ccagctactc 


aggaggctga 


61800 


gaactttcag tgagccgaga 


tcgcaccact 


61860 


ccatctcaaa aaataaaata 


aaataatgtc 


61920 


tatagctaag ctcatagaaa 


cacatttaac 


61980 


gaaagagtgc aaacataaag 


caagacaaag 


62040 


ataagagaaa taatacttat 


tactcttatt 


62100 
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tttattacta ctacaattta cttagcatct 
acaataactg tataacaatt tttattctca 

5 

taaattaata gataacttag cctagacccc 
attctaggtt tctcttctcc aaagagtaag 
10 taaagaataa taaaaaacaa agaacaataa 
agatagctat tataacactg tgctagtcat 
gggaaggtat agagcaaagt taaaacctga 

15 

gtatgttgca acatagtgtg cttaacttag 
atgaaggata atcaccatta tgattcaacg 
20 gttcagagaa aagaaattgt gagggagggg 
ggtaagtggc tagaggctgt cacatcatga 
ggaaacacca gaaaatgaca aggtacagtt 

25 

tctttcttgt cctttcaaga tgaaagtaca 
ttttcttacc cctcgaatct gggttgtcta 
30 acacatgact cagcagaggt ttgaaaagta 
ccaactgcca gagacatgac tgaggccatc 
tgaccacagt tgcatgagac agcctggtag 

35 

accacacaca ggacccacag atgtgtaaac 
acattttgag ggatttgtta cacagctaaa 
40 gtgacaggaa aaatggatgt tatctatact 
gagagatttt gcgttctacc tgtgtctgaa 
ttgccatttg gtaactaaaa tgtccttttt 

45 

gctggagtgc agtggtgcaa tttcagctca 
ttctcatgcc tcagcctccc gagtagctgg 
50 aattttttat atttttagta gagatggggt 
tcctgacctc aggtgatccg cccgcctcag 
gccaccatgc gcggtcccta aaatgtcttt 

55 

atgtatatat gaggaatcag aaaccaggag 
aaactcctaa ctaaaccagg actacttgtc 
60 tacttatagc catcttggga tcttgagcag 
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actacactcc 


aggtacttta 


aaatttactc 


62160 


taaaggaaat 


gtaatatttt 


aataaatatg 


62220 


ataaatagta 


aataaaaggg 


cttggggttc 


62280 


tcttccactg 


catcaccaga 


ttgcatgaat 


62340 


aaaattaaag 


ttatatttct 


gccatttgtc 


62400 


catgcttaga 


gggagaaaaa 


acaaaaactg 


62460 


gtaattaggg 


tgaaatggtg 


ttttataaaa 


62520 


ggaatgctaa 


cacaacatca 


gatgatgttc 


62580 


tccatagaaa 


actgaacaag 


agatggatac 


62640 


tggcctttta 


aaatattcta 


actggcaact 


62700 


ggttcaatta 


aggaagacct 


gatctcaata 


62760 


aaaaacagac 


acaaattgtt 


cccttccctt 


62820 


gatcctccca 


tgaagacatt 


agggggccta 


62880 


tgaaatttga 


ttggccaata 


tgacattaac 


62940 


tttgtgaatt 


ggaagagaac 


caagagtctg 


63000 


ctaattcatg 


taattggaca 


ctccaccagg 


63060 


agaagtgctg 


agcagcctta 


gatgagacag 


63120 


aaaagccgat 


ggtggttgct 


ggaactaact 


63180 


gctacttaaa 


tcaggtggca 


aaaccacaaa 


63240 


tcaagaagtg 


gttaatttgt 


tccgtggcct 


63300 


gatatggaac 


tggaataaat 


gtgtatctag 


63360 


tttgagacag 


agtcttgctc 


tgttgcccag 


63420 


aggaaacctc 


cggctcctgg 


gttcaagtga 


63480 


gactacaggc 


acgtgccacc 


acacccggct 


63540 


ttcaccatgt 


tggccaggct 


ggtctctaac 


63600 


cctcccaaag 


tgctgggatt 


acaggcgcga 


63660 


atatgtagat 


catctagtcc 


aatttggtca 


63720 


agggggatgt 


gacttgccaa 


aagtcacagt 


63780 


aggttctctg 


atccccagtc 


ccaggcattt 


63840 


tcctgagatt 


ctatgactct 


gtgactctat 


63900 
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ggttctataa ttccaagaca aagcaaacaa 
aaaagaacat ggtctagact gaagtaccaa 
5 caccatcatg gattcaagca ccgcacacag 
cactgcttca gaatatgagt ccacagaact 
cttgcaaaaa ttatttgcta gaaaaatgaa 

10 

tgtatatttt actgaggcag gggaaaggct 
ggtaggagcc tattccaatt cttctttaag 
15 tttgagagag gatgtttgag aagaacgaat 
atcatttagg actcacctac ttaaaaagtg 
cagactgaga aggtggactg agaagaagag 

20 

atattgtata ccttccattt ttgtatattg 
accactgctt ggtgtcacag tgcatcacat 
25 aaaaacccga agagatggag agtcctctgg 
tcagagaatt ggccatctcc aggcagggct 
ctggtatact catttagaat tcagtgtttc 

30 

tgaagaactt atgaagacac aggtttccca 
ggatagggcc tgatagttcg catttctagc 
35 ggaccacatt ttcaggagca ctgatttaga 
aggaattaag aaagtaccca ggagcatcat 
accaaagaga gtaattgcca aatccttttg 

40 

aactatcact ttgcttttcc tcaccatgac 
ctgtttggaa ttatgacctt ttcttttgga 
45 tatccaaggt ttccctttat atttctttca 
agtatagtca atcaagttca atttgaagcc 
caatgaaata agctagatcc agttgtatga 

50 

tgttgaaatt atttccccat tagttcatca 
aagttccttt gggaatactt cttttgaggc 
55 ttgaaattag ttactattta ttataagcca 
cccaaagacc tctcagtcta gcaagaaaac 
taaatgttat cacagaggtg caggaaatgc 

60 

ttctttacag aagtgaagtg ggaagggttt 
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ttccagtgct 


ccaggcagcc 


tcagcacaag 


63960 


ctaaatcatc 


tcctttcaaa 


ttatcaccga 


64020 


tccggtgttt 


ctggtatttc 


ctccagaaat 


64080 


ttcagccacg 


accttttcaa 


ctcaaagccc 


64140 


aatcttaggg 


gtaagtaaga 


cttgccccta 


64200 


agggaaatta 


tctgttggca 


catgtttgaa 


64260 


acaatgtgaa 


tatagagtgt 


tggtgggcca 


64320 


catgtgggtc 


tggacttgac 


aaaaaatgct 


64380 


atagttgcta 


tggcagtaaa 


tgagattact 


64440 


agcagagcac 


atagactcac 


acgtttttgt 


64500 


tcctcattag 


tctttcccag 


cacttgtcaa 


64560 


gagtggctca 


gagagggaga 


gaaaaaaaaa 


64620 


gcctgagagt gtgacaatac ctcaagaggc 


64680 


cctcacagtc 


caattctgct 


tgaggccagc 


64740 


tcagacattc 


acatttatgg gctaaagaac 


64800 


gcctcacctg gagaccgttc 


agtaagtatg 


64860 


tcccaagtga 


tcctgatgtt gccaatccat 


64920 


tcattttacc 


tatcaccaaa 


taccagatag 


64980 


aaagggactt 


ctattattct 


tcccctaatt 


6-5040 


accaaaagaa gatattctaa 


acagaaggga 


65100 


tttttcctct 


atttgcagac 


tatccagatc 


65160 


gttatcttcc 


ttttcacttt gttaaaacca 


65220 


ggatatccat tctggggctc tgttttggtg 


65280 


atgccaacca 


ggatgttagg gaattcttca 


65340 


catgctttca 


aaagtgcaaa 


aaactaacct 


65400 


cagtcactga 


tattccatca 


ttaggtcttg 


65460 


ttgcagttgt 


ttatttggct 


attcattcat 


65520 


gatacacaga gaaaggcaat gggcctctcc 


65580 


atgtaaacag gtcacttcaa gtcagtgtga 


65640 


ttatgtaagc 


agagatgaag gagcaattta 


65700 


ctcagatata 


ccttttgagc 


tgcggagagg 


65760 
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tatttattaa acccagtgta gtcacaatca ataaacatta acttaggacc taatttagtg 65820 

ccagagaatg ggcagagcaa tggcttcagg gaaaagaaag cagtgcctga gctcaaggtc 65880 

5 

tcagttgagt gaggagtcag acaattaaca cagggtgtga tgagggagaa taaacacagg 65940 

ggctgcctcc ataggaaagg catcaacaaa acagagggga caacgcccag agacatttcc 66000 

10 agcttgacca agttttaaag ggtgaatggt taatcaggtg aggacacaca cacagaagat 66060 

gggagttggg atacccaagt gtaaggatta gggatgtctt gttgcatgcc aagggagaaa 6612 0 

tatgagcaca ggctgaaggt atgggagagc attttctatt atgtgaacta taagagttca 66180 

gcacagggag tgaggatgga gagatgaggc tggaaaatac ccatgaaggg gctggaaatt 66240 

taaacatgga agtgctgtaa tcagatttga aatttccact agacaacaat gctaccacgt 66300 

20 gtgacccact tccaagaaaa atcttggaga ttttaagaac taccagcaca ttcactgcaa 66360 

ccccacctta tgttcccatc caagagaaca cactggattg cagtaagatg gaatccaatc 6642 0 

ttaatgatta taaactgact tgaaatccct ttgtcatgag atcaaagctg agtccacaat 664 80 

agctataaaa agaacatgga tgagtaaacc aaagccctgt acggttaaaa aaaatgttca 6654 0 

tgttcaaata aatagttagg gctaaaggag atttccagac cagtgttttg tccactattc 66600 

30 catttgtgga aagtcatata aagcaactca cccttctcac tgccctgagg agtgtgtgaa 66660 

cttagaaggc gggcctgtaa aagaaatgca tagaggtcta ttctgtaaga actcaacatc 66720 

agctaaacat ggtcatcatt aacatattta ttcctcttaa cagttcatta attctggagc 66780 

cttcctaatt gcagtgaaaa gaaaaaccac agaaactctg gtgagttata ttcttacttt 66840 

attaaaaata tattttgtag ccagtcatgg tggctcatac ctgtaatccc agcactttgg 66900 

40 gaggccaaga cgggtggatc atgaggtcag gagttcaaga ccagcctggc caacatggtg 66960 

aaaccccatc tctactaaaa atacaaaatt agccaggtgt ggtggcacac acctgtagtc 67020 

ccagctactc aggaggctga ggcaggagaa ttgcttgtac ctgggaggtg gaggttgcaa 67080 

ggagccgaga tcgtgcctgg gcaacggaaa agactccatc tcaaaaaaaa aaaacaaaaa 67140 

aaagtaatga gttacatttt cactatttcc actttattaa aaatatataa ctttcaaata 67200 

50 atttatcctg gccttgaaaa aaatttgtaa aatcattagg tcaccataat tccacaaact 67260 

atcaagtatt aataccaata aacagcattt gcttcctgtt ttcctagaat tataacccat 67320 

tatgctagag gagaaagttt ctctaggtca tctagtttag ctcttatttt atgtatgaag 67380 

acactgaggc ttacaatcgt taaatagctt gtccaaattc ccagccaatt aatggtagaa 67440 

ttatttcaac attccagtta cttattcatt cttttttgtt ttctgggtat ttacggcata 67500 

60 ccaggcactt tgacaggcac tactttgact catacgaaag gcaaactgta gacaatctaa 67560 



35 



45 



55 
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ttaaaccata aacagaatct caaaataaag 
ataaattata agtagaatat ttttacttta 
5 atgccaacaa aagattttag caaaggactg 
attaaaaatg gcagagaagt gcagcattca 
taaacagacc atgactttca ccagaaaatc 

10 

tcccaactga ataagagtta acttaggcat 
gttcacaaga tccccttatt gtgtagatga 
15 cgcacaggat cacagagctg cctcgtggca 
cattctccgc actcattgct ttgtttgtag 
ttcttattct atttcagata atattgagcc 

20 

caatagctgg aatcattctc ctcacatttg 
gttattctca ccaaaatagt cagtgtaagg 
25 tgcatttata gagtgatgag taaagggctt 
caccattcag atcatgttgt ggacacatgt 
tccataatat ttgatagaca ttacaccagg 

30 

aactcccacc tttaggaatt cacaggagca 
atacaaatat aaatctaaaa aaaaaaggta 
35 atgtacatta gaatcacagt ggtgtgtttt 
aacttccgat aaagcatgat aagattgaaa 
accactaaga ttagtgtaaa agaatgtaaa 

40 

gttaccaaac acacacaaaa attgctgagg 
agggggaagt catattagaa gatatttctt 
45 gaatgcacca ggtccacaga aagcaggact 
gtaatggtta agtatggcta tgtggattaa 
aaaaaacaaa acaaccttaa agacattaaa 

50 

caatttttag atgaaagcct gcaacctgaa 
agctacttat gccttcagac ccatttgcca 
55 ccatgtggac tgaaagagaa agaagtcaag 
aagagagatt ttctccgtat taaactaggt 
gggcaaaaag atatacaacg taccctgccc 

60 

aaaaggaaaa gtatttgaga agtgtggcca 
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aatacagtgc tactgaatgt aaagttatag 67620 

ctcaaagaat tgaagacata agatattact 67680 

aaaatcctta gtttgtcatt ttagggagaa 6774 0 

gttaaactta ggagtcttaa aataaaaatg 67800 

aaaatgcaat tgttacaggc acacccttcc 67860 

agaatatggg gaatggaaag gttactaaaa 67920 

aggcctgaag cccaaagatg tgaagcaatt 67980 

ttaagatgct gtgactaaaa tcctgctctc 68040 

tacaggctgg acctcagtca cattgttatg 68100 

gaataatgaa ttttcttagt gccctgggag 68160 

gtttcatcct agatcaaaac tacatttgtg 68220 

ctgttactgt cctgttcttg gtaagtatgt 68280 

aatagaaaat atttattagc actcaatcag 68340 

attttctttt actgacaaac atttatgaaa 68400 

cattgtggaa gagaaaaata aacacattat 68460 

caagagataa aacatacaca aatcagtatg 6852 0 

cagcagtatg gctgtagttt atagccctat 68580 

tacagttctt gatttggttt taacaggctg 68640 

aggggcattt attttagctg ccttcaaaat 68700 

aggtttgatc cacatggaga gaaaaagaga 68760 

aaattgaaaa cccaccatct aaggcaaagc 68 820 

acagcagagc ctaagaaata cttaatattt 68880 

aaagagcaga aaatagggag actgattttg 68940 

ctcttctctg aaaacagaag tctttttatt 69000 

aagctgaaag gtcagctgga actactagat 69060 

atttacaagt ggaactctga agagcatgaa 69120 

gtgatgctta tttggaatcc aatccctggc 69180 

gtttcacatc cactcaaagt ataaagactt 69240 

ctttgatagg ctactacgct cttcaggtaa 69300 

caaaatagca aagtcgctga acaaagttta 69360 

caagccaaac aaaccccttg acaacctcct 69420 
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tgcacgcccc aatccccact gacctttgct 
tacctgaatt tggtcttggt atgtggtgcc 

5 

ctatctagga gaaaaacatt cattaatctt 
attaacagta agcactcaaa agaggcctaa 
10 atagaaacag aacatagcaa aaaaaaaatc 
gaagagattt taaaaccaca atatttatca 
ccaagatggc caaataggaa cagctccggt 

15 

acgggtgatt tctgcatttc catctgaggt 
cagtgggcgc aggtcagtgg gtgcgtgcac 
20 gcctcacttg ggaagcgcaa ggggtcaggg 
acggacggca cctagaaaat cgggtcactc 
ttaaaaaacg gcgcaccacg agattatatc 

25 

cggagtctcg ctgattgcta gcacagcagt 
ctgggggagg ggcgcccgcc attgcccagg 
30 ctccaactgg gtggatccca cgacagctca 
ctctgggggc agggcacaga caaacaaaaa 
ccctgtctga cagctttgaa gagagcagtg 

35 

agactgcctc ctcaagtggg tccctgaccc 
ccccccagca ggggcacact gacacctcac 
40 gagggtcctc tctgttagaa ggaaaactaa 
catctgtaca tcaccatcat caaagaccaa 
aacagaacag aaaaactgga aactctaaaa 

45 

gcagttcctc accagcaacg gaacaaagct 
gaggcttcag acgatcaaat tactctgagc 
50 aggttgaaaa ctttgaaaaa aatttagaag 
agtgcttaaa ggagctgatg gagctgaaaa 
gaagcctcag gagtcgatgc gatcaactgg 

55 

tgaatgaaat gaagcgagaa gggaagttta 
gcctccaaga aatatgggac tatgtgaaaa 
60 aaagtgatgg ggagaatgga accaagttgg 



aggcataaac 


tggataaggc 


aagaatgttt 


69480 


cctggtacct 


ggcagaagca 


aaaccgaagt 


69540 


aataaattcc 


catcaaattt 


tccattgaaa 


69600 


aaagaaataa 


gacaatgcga 


ataacaacta 


69660 


acctgtaaat 


ttctgatact 


aagtatatta 


69720 


tgttttaaga 


aataaatgag 


gggggaggag 


69780 


ctacagctcc 


cagcgtgaga 


gacgcagaag 


69840 


acccggttca 


tctcactagg 


gagtgccaga 


69900 


cctgcgcgag 


ccgaagcagg 


gcgaggcatt 


69960 


agttcccttt 


ctgagtcaaa 


gaaaggggtg 


70020 


ccacccgaat 


actgcgcttt 


tccgacgggc 


70080 


ccgcacctgg 


ctcggagggt 


cctatgccca 


70140 


ctgagatcaa 


actgcaaggc 


ggcagcaagg 


70200 


cttgcttagg 


tatacaaagc 


agctgggaag 


70260 


aggaggcctg 


cctgcctctg 


taggctccac 


70320 


gacagcagta 


acctctgcag 


acttaaatgt 


70380 


gttctcccag 


ctggagatct 


gagaacgggc 


70440 


ctgacccccg 


agcagcctaa 


ctgggaggca 


70500 


acggcagagt 


actccaacag 


acctgcagct 


70560 


caaacagaaa 


ggacatccac 


accaaaaacc 


70620 


aagtagataa 


aaccacaaag 


atggggaaaa 


70680 


agcagagtgc 


cactcctcct 


ccaaaggaat 


70740 


ggatggagaa 


tgactttgac 


gagttgagag 


70800 


tatgggagga 


cattcaaacc 


aaaggcaaag 


70860 


aatgtataac 


tagaataacc 


aatacagaga 


70920 


ccaaggctcg 


agaactacgt 


gaagaacgca 


70980 


aagaaagggt 


atcagcgatg 


gaagatgaaa 


71040 


gagaaaaaag 


aataaaaaga 


aatgagcaaa 


71100 


gaccaactct 


acatctgatt 


ggtgtaccta 


71160 


aaaacactct 


gcaggatatt 


atccaggaga 


71220 
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acttccccaa tctagcaagg caggccaacg 
aaagatactc ctcgagaaaa gcaactccaa 
5 aaatgaagga aaaaatgtta agggcagcca 
agctcatcag actaacagtg gatctctcgg 
ggccaatatt caacattctg aaagaaaaga 

10 

aactaagctt cataagtgaa ggagaaataa 
attttgtcac caccaggcct gccttacaag 
15 ggaacaactg gtaccagccg ctgcaaaatc 
gaagaaactg catcaactaa cgagcaaaat 
attcacacat aacaatatta actttaaatg 

20 

acagactggc aaattggata aagagtcaag 
atctcatgtg cagagacaca cataggctca 
25 caaatggaaa acaaaaaaag gcaggggttg 
aatcaacaaa gatcaaaaga gacaaagaag 
aacaagaaga gctaactatc ctgaatatat 

30 

taaagcaagt cctgagtgac ctacaaagag 
actttaacac cccactgtca acattagaca 
35 cccaggaatt gaactcagct ctgcaccaag 
acgccaaatc aacagaatat acattttttt 
accacatatt gggaagtaaa gctctcctca 

40 

actatctctc agaccacagt gcaatcaaac 
actgctcaac tacatggaaa cagaacaacc 
45 aaatgaaggc agaaataaag atgttctttg 
agaatctctg ggatgcattc aaagcaggat 
acaagagaaa gcaggaaaga tccaaaattg 

50 

acaagcaaga gcaaacacat tcaaaagcta 
cagaactgaa ggaaatagag acacaaaaaa 
55 ggttttttga aaggatcaac aaaattgata 
gagagaagaa tcaaatagac gcaataaaaa 
cagaaataca aacgaccatc agagaatact 

60 

atctagaaga aatggataaa ttcctcgaca 
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ttcagattca 


ggaaatacag 


agaacgccac 


71280 


gacacataat 


tgtgagattc 


accaaagtgg 


71340 


qaqaqaaagq 


ttgggtaacc 


ctcaaaggga 


71400 


cagaaaccct 


acaagccaga 


agagagtggg 


71460 


attttcaacc 


cagaatttca 


tatccagcca 


71520 


aatactttac 


agacaagcaa 


atgctgagag 


71580 


agctcctgaa 


ggaagcacta 


aacatggaaa 


71640 


atgccaaaat 


gtaaagacca 


tcgagactag 


71700 


aaccagctaa 


catcataatg 


acaggatcaa 


71760 


taaatggact 


aaatgctcca 


attaatagac 


71820 


acccatcagt 


gtgctgtatt 


caggaaatgc 


71880 


aaataaaagg 


atqqaqqaaq 


ttctaccaag 


71940 


caatcctagt 


ctctgataaa 


acagacttta 


72000 


gccattacat 


aatggtaaag 


ggatcaattc 


72060 


atacacccaa 


tacaggagca 


ccaagattca 


72120 


acttaaactc 


ccacacatta 


ataatQQQaq 


72180 


gatcaacgag 


acagaaagtc 


aacaaggata 


72240 


cagacctaat 


agacatctac 


agaactctcc 


72300 


cagcaccaca 


ccacacctat 


tccaaaattg 


72360 


gcaaatgtaa aagaacagaa 


attataacaa 


72420 


tagaactcag gattaagaat 


ctcactcaaa 


72480 


tgctcctgaa 


tgactactgg 


gtacataatg 


72540 


aaaccaatga 


gaacaaagac 


acaacatatc 


72600 


gtagagggaa 


atttaaagca 


ctaaatgccc 


72660 


acaccctaac 


atcacaatta 


aaagaactag 


72720 


gcagaagcca agaaataact 


aaaatcagag 


72780 


ctcttcaaaa 


aattaatgaa 


tccaggagct 


72840 


gactgctagc 


aagactaata 


aagaaaaaaa 


72900 


atgataaagg ggatgtcacc 


accgatccca 


72960 


acaaacacct 


ctacgcaaat 


aaactagaaa 


73020 


catacactgt 


cccaagacta 


aaccaggaag 


73080 
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aagttgaatc tctgaataga ccaataacag 
taccaaccaa aaagagtcca ggaccagatg 

5 

aggaggaact ggtaccattc cttctgaaac 
tccctaactc attttatgag gccagcatca 
10 ccaaaaaaga gaattttaga ccaatatcct 
aaatactggc aaactgaatc cagcagcaca 
gcttcatccc tgggatgcaa ggctggttca 

15 

atataaacag agccaaagac aaaaaccaca 
ttgacaaaat tcaacaaccc ttcatgctaa 
20 cgtatttcaa aataataaga gctatctatg 
ggcaaaaact ggaagcattc cctttgaaaa 
cactcctatt caacatagtg ttggaagttc 

25 

taaagggtat tcaattagga aaagaggaag 
ttgtatatct agaaaacccc attgtctcag 
30 tcagcaaagt ctcaggatac aaaatcaatg 
acaacagaca aacagagagc caaatcatga 
gaataaaata cctaggaatc caacttacaa 

35 

acaaaccact gctcaaggaa ataaaagagg 
catgggtagg aagaatcaat atcgtgaaaa 
40 tcaatgccat ccccatcaag ctaccaatga 
taaagttcat atgggaccaa aaaagagccc 
acaaaactgg aggcatcaca ctacctgact 

45 

aaacagcatg gtactggtac caaaacagag 
cagaaataac gccgcatatc tacaactatc 
50 aatggggaaa ggattcccta tttaataaat 
gaaagctgaa actggatccc ttccttacac 
aagacttaaa cgttagacct aaaaccataa 

55 

ttcaggacat aggcatgggc aaggacttca 
aagacaaaat tgacaaatgg gatctaatta 
60 tcaccatcag agtgaacagg caagctacaa 



gagctgaaat tgtggcaata atcaatagct 73140 

gattcacagc cgaattatac cagaggtaca 73200 

tattccaatc aatagaaaaa gagggaatcc 73260 

ttctgatacc aaagccgggc agagacacaa 73320 

tgatgaacat tgatgcaaaa atcctcaata 73380 

tcaaaaagct tatccaccat gatcaagtgg 73440 

atatatacaa atcaataaat gtaatccagc 73500 

tgattatctc aatagatgca gaaaaagcct 73560 

aaactctcaa taaattaggt attgatagga 73620 

acaaacccac agccaatatc atactgaatg 73680 

ctggcacaag acagggatgc cctctctcac 73740 

tggccagggc aattaggcag gagaaggaca 73 800 

tcaaattgtc cctgtttgca gatgacatga 73860 

cccaaaatct ccttaagctg atcagcaact 73920 

tacaaaaatc acaagcattc ttatacacca 73980 

gtgaactccc attcacaatt gcttcaaaga 74040 

gggatgtgaa ggacctcttc aaggagaact 74100 

atacaaacaa atggaagaac attccatgct 74160 

tggccatact gcccaaggta atttacagat 74220 

ctttcttcac agaattggaa aaaactattt 74280 

acatcgccaa gtcaatccga agccaaaaga 74340 

tcaaactata ctacaaggct acagtcacca 74400 

atatagatca atggaacaga acagagccct 74460 

tgatctttga caaacctgag aaaaacaagc 7452 0 

ggtgctggga aaactggcta gccatatgta 74 580 

cttatacaaa aatcaattca agatggatta 74640 

aaaccctaga agaaaaccta ggcaatacca 74700 

tgtctaaaac accaaaagca atggcaacaa 74760 

aactaaagag cttctgcaca gcaaaagaaa 74 82 0 

aatgggagaa aatttttgca acctactcat 74880 
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20 



ctgacaaagg 


gctaatatct 


agaatctaca 


caaacaaccc 


catcaaaaag 


tgggtgaagg 


tttatgcagc 


caaaaaacac 


atgaaaaaat 


aaatcaaaac 


cacaatgaga 


taccatctca 


caggaaacaa 


caggtgctgg 


agaggatgtg 


gggactgtaa 


actagttcaa 


ccattgtgga 


aactggaaat 


accatttgac 


ccagccatcc 


aatcatgctg 


ctataaagac 


acacgcacac 


caaagacttg 


gaaccaaccc 


aaaug Lccaa 


atatatacac 


catggaatac 


tatgcagcca 


ggacatggat 


gaaattggaa 


atcatcattc 


aacaccgcat 


attctcactc 


ataggtggga 



25 aggggaatat cacactctgg gaactgttat 
cgggagatat acctaatgct agatgacgag 
tgtatacata tgtaactaac ctgcacaatg 

30 

taaaaaaaaa gaaaaagaaa aaggccatta 
aagccaatag ttttccaagc tgacttagaa 
35 tatttgtcag cataacctac cttctcttat 
taaaaatgtc aagacttgaa aaaaaaaaag 
ggaatataaa ataaaagaaa gtgaactaga 

40 

ataaaaatag aaaattaaaa aatttaaaat 
gtttaaaaaa caaaacacaa ctgcattctg 
45 caaatagatt taaggcaaga agcattgcta 
agggtcgatt cacaagagaa atattacatt 
aaagccaaaa ggaaaagcta actcagctga 

50 

aaattttagc acacatttct cagcaactga 
gaattttgaa tacaattgat aaaattgacc 
55 caaataacta caattataat actgtcaact 
gtatttatca tgtataacat attttgaagt 
agctaattaa caaaggcatt acctcccata 

60 

attcactatc tttgcacttt taagaataca 
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atgaactcaa 


acaaatttac 


aagaaaaaaa 


74940 


acatgaacag 


acacttctta 


aaagaagaca 


75000 


gctcaccatc 


actggccatc 


agagaaatgc 


75060 


caccagttag 


aatggcaatc 


attaaaaagt 


75120 


gagaaatagg 


aacactttta 


cactgttggt 


75180 


agtcagtgtg 


gcaattcctc 


agggatctag 


75240 


cattactggg 


tatataccca 


aaggactata 


75300 


gtatgtttat 


tgcggcatta 


ttcacaatag 


75360 


caatgatagg 


ctggattaag 


aaaatgtggc 


75420 


taaaaaatga 


tgagttcatg 


tcctttgtag 


75480 


tcagtaatct 


atcgcaagaa 


caaaaaacca 


75540 


attgaacaat 


gagatcacat 


ggacacagga 


75600 


ggggtggagg 


gaggggggga 


gggatagcat 


75660 


ttagtgggtg 


cagggcacca 


gcatggcaca 


75720 


tgcacatgta 


ccctaaaact 


taaagtataa 


75780 


aaaaaaaaaa 


gaaaaaaaaa 


gaaaggagat 


75840 


actcgacaaa 


aaaaatgggt 


atatccttac 


75900 


tatctgtcag 


tctgaaatac 


tcctcaataa 


75960 


aaagaaatga 


taaactttaa 


aatacctcag 


76020 


agattttctg 


tgtaaaacta 


ttatgtaccc 


76080 


tttaaaaaaa 


gatggtccca 


gtaaataaga 


76140 


tttacaagaa 


acagctacac 


tagtaacaaa 


76200 


aacgtataat 


aggggacttt 


ctataataaa 


76260 


tctaggtttg 


aatgctccta 


ataacaactc 


76320 


atgagtagag 


aaagctaaaa 


tcatcataag 


76380 


aagaatatgt 


agacagacca 


taaaattaca 


76440 


taaatgtcat 


atatagaatg 


aacactacac 


76500 


aarathttta 


a t* t~ era c aaa t 


aaaattatat 


76560 


acatatatgc 


attgtggaat 


gtttacatct 


76620 


gttatcactt 


ttgcagtgag 


aacacttaac 


76680 


atagccatca 


ttctcagcaa 


actaagacag 


76740 
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gaacagaaaa ccaaacacct tatgttctca 
catggacaca gggaagggaa tatcacacac 

5 

ggagggagag ctttagaaca aatacctaat 
ttgataagtg cagcaaacca ccattgcaca 
10 catgtatccc agaacttaaa gtaaaataat 
aactatgggc accccgctgt acaacagatc 
tgatggaagt gataagacaa tagaattaca 

15 

acccagaatt ctaggcctgg tgaaaatgtt 
cagtggctca ggcctgtaat cccaggactt 
20 gtcaggagtt tgagaccagg ctggccaaca 
aaaattagca gggcatggtg gtgggcagct 
ggagaatcac ttgaacctgg gaggcggaag 

25 

tccagcctgg gcagcagagc aagactccgt 
atgaaagtga aataaaaaca actgaaaaaa 
30 aattagccat gtgctgagcc tacaagcctc 
agtatattgt ctgaaaacaa tggaattgaa 
aaatccccaa aactgttttg aaattctaag 

35 

cataaaaaaa ttaaaaacaa aataatagaa 
gtgcttagag gaaactttac gacctcaaag 
40 agtaaattat ctaaatattc ataccaagaa 
tgaaagaatt gagatatagt attctataat 
atgaa^taca acacaaacca tggataagat 

45 

aactaataaa attggtaaac ccctagatat 
tcattatcag aaatgaaggt gtatcactat 
50 ttattaacat aaaaattatt tgatgcatta 
caaaactatg caaaaccacc cgccatgact 
ggcttatatc taataaaaaa tcaaatacat 

55 

gcaaagcctt gcaggagcag actaatttct 
actgaatact ttcagttcac ttcccaactc 
60 tgacatgaaa acataaggac attataaaat 



ctcataagtg agagttgaac aatgagaaga 76800 

caaggcctgt tggggggtag tgggacaagg 76860 

gcgtgtgggg cttaaaacct cgatgacagg 76920 

tgtatacctg tgtaacctgc acgtcctgca 76980 

aataaaaaaa gaatacaata tactgtcatt 77040 

tcttgaactt attcctccta tctaactaac 77100 

tctttaaagt gctcaaacaa aataatgcca 77160 

cttcaaaaat gaaagtgaaa tagccaggcg 77220 

tgggaggcca aggtgggcag atcacttgag 77280 

tggtgaaaca ccatctctac taaaaataca 77340 

ttaatcccag ctactcggga agctgaggca 77400 

ttgaagtgag ccgggactgc accactgcac 77460 

ctcaaaaaca acaacaacaa caaaaacaca 7752 0 

ttgacagatt tcacttagaa cattttcaag 77580 

aacaaaattc aaagaattga aatcatatac 77640 

ttaaatgtca attttaaaaa ggcaaaaaaa 7 7700 

ttatacaagg ctcaaagaat ttacaacaga 77760 

atatttatat aaattgatta caactaaagt 77 82 0 

gcaagtattg ggaaggaaaa tagactaaaa 77880 

attagattaa aaaattaaat ttaaagaaaa 77940 

agacataaca aagaatagag ataaaaacta 78000 

aaaataaagc taaaatatgg atgtttgaaa 78060 

ttaaaaagaa aaaagagaaa acacaaacta 7 812 0 

caatacaata gccaagaaca tatgaggaaa 78180 

gatgattgtt tccttataaa acctacctac 78240 

aacacaggaa aaaatgtata tatctgattg 78300 

aatttaaaac tttctctcaa aaaagaaaaa 78360 

tccaaacatt tagaaaagaa acaacaccaa 78420 

acgagtattc atgtgaggct gtcacaagtt 78480 

atttaaaaat tacgggtaac tctatcttgt 78540 
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aaactttgtg aagaaatctg aaataaaata 
ataatatacc gtggccaagt cagtggatat 
5 catcaatcag tgatacacct aaatgtgaaa 
tgcagggaac ttaggagaat atcttcacga 
aaacataaaa gggactaaat ataggggaaa 

10 

aaagggagaa aatttcctct ctctatatat 
aatatattaa gcactcctga aaatcaataa 
15 gcaagcagtt cacaaagaaa gatagataaa 
cattattaat catcagataa atgcaaatta 
ccagaatcat tttaattaaa aggaccacca 

20 

ctggaactct catacccttt tgatggaaat 
aaaactattg ggtggttttt taaaaagtta 
25 agacaaaaat gtttatagca gtcctattca 
tctatcaaca acaaaatggc taattattca 
gaacaaccta ctgacacaga ggagcatgaa 

30 

aatttcagat aaaaaataaa caggcaaatt 
gttgactctc aatagggtgg gaggattgat 
35 tgataaaaat gtgatatatc ttgacgtgga 
tttttttttt gagatgaagt ctcactctgt 
cggcttactg caacctccat ctcccaggtt 

40 

tagctgagat tacaggctca tgctgccatg 
tggggtttca gcatgttgcc caggctggtc 
45 cctcggcctc ccaaagtgct aggattacag 
cttctttata tataaacttt gatatagtta 
acagcaacat tttcatgtgt ttatatacag 

50 

aaattttctt tttaaaagat aaagaatatg 
tggccaaaca tacaataatg ctccaattca 
55 acaatgaact aatatttaca tcactagact 
tggtatttat cactggtgaa aacaatgttc 
cactgccatt ttgggaagca aggatacaac 

60 

ttaacccaga tgtctatctc tacagtaatt 
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ttaacaaaat 


caaattcagc 


ataatatatg 


78600 


cagaagtaca 


caattgattt 


cacattcaaa 


78660 


tcctaaacta 


taaagcttga 


gtgggtgggc 


78720 


ccatggtagt 


agacaaagct 


ttcttagata 


78780 


aataaaataa 


aatggactac 


caaagcacag 


78840 


atctctctct 


cacaaaagac 


ccacatccag 


78900 


taaaaaggcc 


aaaaaaagat 


taagacttga 


78960 


tggctagtga 


gcatatgaga 


acgtgctcaa 


79020 


aaatcacaat 


aagacactac 


tccacacaca 


79080 


ataccgggtg 


ttggaaggaa 


tgtgaagcaa 


79140 


gggaaacgga 


aaatggttca 


accaccttgg 


79200 


atcatacacc 


tactttatgt 


ctcatgatat 


79260 


ttatagttaa 


actgataagg 


aactcaaatt 


79320 


aataatagaa 


tacttctcag 


caatataaaa 


79380 


tgaatttcaa 


tttattatac 


cgaggcaaac 


79440 


taacacatga 


tgctagaaat 


cagaagggtg 


79500 


tgggaaggga 


cacaagagaa 


cctactgggg 


79560 


aagaaaaaga 


gataaaagac 


atttcttctt 


79620 


tgcccaggct 


ggtgtgcagt 


gatgtcatct 


79680 


caagtgattc 


tcctgtctca 


gcctcccgag 


79740 


cccagctaat 


tttttgtatt 


ttagtaaaga 


79800 


tcaaactcct 


gagctcaggc 


aatccaccag 


79860 


gcatgagctg 


gcgcaaccag 


ccaaaataca 


79920 


aaaatggaaa 


atatagtttt 


aaagcagata 


79980 


agtgcatata 


actgacaaga 


agtcaagaaa 


80040 


aggaaagaat 


ttacaaaaag 


gaaactacaa 


80100 


ctaatagtca 


gggaaatgtc 


aattaattta 


80160 


ggcaaaaatt 


aatgagttat 


aatttatggc 


80220 


ttgtatatca 


ggactagaag 


tgtgcaatgt 


80280 


atttacctac 


ctgaatgata 


aacagacctt 


.80340 


aaaccacctg 


tatataatgg 


tatatgtaca 


80400 
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acaatgctat tgcagcattg ttcagagggg 
aaagaggaaa tcaatcagtt cctgtcacaa 

5 

atgaaatatt aagtggccat taaaaaccat 
ttaaaatata gtagacaaat aaggcttcag 
10 acctaagtat aaagtaaagc tctcagaaga 
gcagcctctg ggctactgac aagatgtgta 
caaaatattt tgaaggtctt ttgctgtgat 

15 

gtggctcatg cctgtaatcc cagcactttg 
caggagttca gcgctgccaa catggcgaaa 
20 ccgggtttgg tggtgggcac ctgtaatccc 
gcttgaacct gggagttgga ggttgcagtg 
ggctgataag agtgaaactc cgtctcaaaa 

25 

aacggtgaac tactagcgta tgcttcttat 
atagttttct tttttccctc tttttctttt 
30 ccttctttct ttccccttca ttagaccacc 
catttataaa atggactatt tcctacatat 
tctctaaaaa gataatatgc tatgctatgc 

35 

tatgctatcc tatgctatgc tagctatgct 
tgctatgcta tgctatgcta tgctatccta 
40 tcctatgcta tgctatccta tgctatgcta 
tactatgcta tgctatccta tggtatccta 
tgctatgcta tgctatgcta tgctatgcta 

45 

tgcaatgcag tgctatgcta tcctatgctg 
tgctatgcta tgctatccta ttctatgcta 
50 ctgaaaatcc ttcactgatt tgattcctaa 
gcttttatta tctgggcatc actatgtcat 
atcatgactc ttcatttttt tttcttcttc 

55 

tgactttcag cattattgaa ttattcattt 
cagaggattg tgattgtgaa caatgttgtt 
60 aaatattatg tagcaatgac ttctgcttgt 



taaaacaaac 


aaacaaaaaa 


acacagaaaa 


80460 


aagaataaat 


gaatgatgac 


aaactacacc 


80520 


gtggctctaa 


aatagctctt 


caatctatag 


80580 


gagaggatgt 


taaccatatt 


aacctttgca 


80640 


ctgcaaactt 


taactggtaa 


gtgcaggtct 


80700 


tctctatttg 


tgctcatagt 


tgccaaaata 


80760 


tgaaaaaaac 


acaggtgggg 


ccgggccatg 


80820 


ggaggccaag 


gtgggcagat 


cacctgaagt 


80880 


ccccgtctct 


actaaaaaca 


caaaaactag 


80940 


agctactcgg 


gaggctgagg 


caggagaatc 


81000 


agccgagatc 


ctgccattgt 


actccagcct 


81060 


aaaaaaaaaa 


gacaaaaaag 


aaaccaaaaa 


81120 


gtctctttcg 


ctttctcttt 


tactaagcca 


81180 


ctatttttag 


ttctttttcc 


tgtttcttca 


81240 


ttttatatat 


gcatttcttt 


gcattttatg 


81300 


ataaatcaaa 


aatgtatttt 


tatatgcctt 


81360 


tatgctatgc 


tatcccatgc 


tatgctatgc 


81420 


atgctatgtt 


agctatccta 


tgctatccta 


81480 


tgctatccta 


tgttatgcta 


tgctatgcta 


81540 


tgctatccta 


tgctatgcta 


tgctatccta 


81600 


tgctatgcta 


tgctatgcta 


tcctatgctg 


81660 


tgctatgcta 


tgctatgcta 


tgctatgcta 


81720 


tgctgtacta 


tgctatgcta 


tcctaggcta 


81780 


tgctatccta 


tgcttaaata 


gatgcatact 


81840 


gcatcagata 


cacttgagat 


atttggggaa 


81900 


tataatcagg 


aaaaccgtta 


ctgtgccaac 


81960 


ttctattaca 


gggaattttg 


attacattga 


82020 


ctctgccttt 


ctcaattttg 


gggtgccact 


82080 


gactagcact 


gtgagaataa 


agatgtgtta 


82140 


aagggaaagc 


caaaggacac 


atagaatctt 


82200 
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aaggctggga actctttgaa gcaaagggga 
tttgactgat aaagcacatt ctgcctcttg 
5 aatggtcacc ttggtgttgt cacctctgtt 
aatgtggtgt tgcaaagata ggaaccaaag 
cttctctggt tgatgacatc tcctgttgat 

10 

actccagtgg tgtgtggaaa gatgagctgg 
cattaatgct gcaagatcat caatgttaaa 
15 aaaaataatg aaggcaaata ttggtggaac 
attatatagg tgttactcat ttaatctcac 
tacccatttt acagaaaagg aaactgaagc 

20 

ataaagtttg caatggcaga gccatagtat 
cttaattcca atgccaagca aagtttggaa 
25 gcttttgcat tcaatctgtt atgatatctc 
taaacttgtg aaaagagaaa taaggtctta 
agggcaccta caaaggtctc aaggacccca 

30 

taccttatat ggtattgcct ctcactggct 
tgtatcatct cacttgtcct gacagccctc 
35 aaaattgaag ctcaaagtgg ttagcttttc 
gaggatttga gtcaagttcc ctaacttcaa 
gaaaaaaaaa actcagcata ctcaaaaaat 

40 

ctggcagcag cttaggttat aggtgcattt 
aaataagtaa gttttttact tcagtgaagt 
45 aatgatgtac agatgtgtga ggcttgagat 
aagtcaacta ggtttaaatg ttgtcttgaa 
aaaagggaat gcttacacac ttctgggggg 

50 

agcagtttgg caatttctca aagaactcaa 
attattgggt acatacccaa aggaacataa 
55 tatgttcatt gcagcactat ttgcaatagc 
aacagcagac tagataaaga aaatgtggta 
agaaaagaat gagataatgt cctttgcagc 

60 

aagcaaacta acacagcaac tgaaaagaaa 
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gtggctcaag tgaaggggtt ctggggacat 82260 

gcaatgccat ctgcacctct ccccaaatca 82320 

agtcatgcat tatttcaact aaaggacagc 82380 

cccaaggcct ggggtgaccc agtcatctaa 8244 0 

ttcactaaga gactccaccc agtggctcct 82500 

ttggtggaat ggtttggccc ataaattcta 82560 

gagctgatca actagaatac taacaatagt 82620 

cactactata ttccaggcat cattctaagt 82680 

aaaagtccta ggaggtgaat tgtattatta 82740 

acagaggggg taagtaattt gctcaaggtc 82800 

caacaggcag ccggttccag aatccaaggt 82860 

ggaactactt tcctctggat tctcttagca 82920 

gtgttctgta gcctatggag tatactactg 82980 

gtattattac aaaaatagtt ttgaacttat 83040 

ggaaatccga gaccacactt tgagaaccac 83100 

gaatacctcc taagtattag aggtctcacc 83160 

agggtagata tgtcttttgt catagtcata 83220 

tgagagcaca aaaggagcat gatttagaaa 83280 

atcccattat cctatgttgt ttatggtata 83340 

gaagtaacag gctttcattt ctactatttg 83400 

gtgaaggtga cggatacaac atggcaggaa 83460 

taggttatga tgaaaaatac ttggcaactg 83520 

atatttgagg gtgcaaagtc ttgggctttt 83580 

gtcaaaaaat agtagatgca agttgtggag 8364 0 

aatgtaaatt agttctgcca ctgtaaaaaa 83700 

agcaaaatta ctattcaacc cagcaatccc 83760 

atcgttctac cataaacaca tatgcacgta 83820 

aaaaacgtgg aatcaaccta aatgcccatc 83880 

catatacacc atggaatact acatagccat 83 940 

aacatagatg aagctggagg ccagtattct 84000 

atactgcatg tgttctcact tataagtggg 84060 
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agctaaacat tgagtacata tggacacaaa 
gagggtggag gatgggagga gggtgaggat 

5 

tacctgggtg acaaagtaac ctgtatatca 
acacatctgc acatgtaccc tgaacctaaa 
10 aaaattatta gaaataaaga tagaagactt 
ttaaacagat cccagaacat gaatagtacc 
tattaagaat gttacataag tgtgacgaga 

15 

caaataacca acaaaaataa tgggatatag 
aaaaatacaa cttgatgaaa acttagtcca 
20 gaaacaaagg ttaatatatg aaaagcactc 
actctcctac cacaatggta tgagtgtaaa 
agtaaaggta aagatacaca tactttataa 

25 

caagagaaat ccttgtacat gtgcaacata 
tgcctgcaat acctcaaatg tccatggaca 
30 ctggaggacc acatattggt aaaaataaat 
tacacattta atgttgagta aagaaagcaa 
taaataatga ttagaaaaca caaaaactgc 

35 

gtctccctct catgctgagc cgaagctgga 
cctccctgcc tgattctcct gactcagcct 
40 gccacgcctg actggttttg gtggagacgg 
agcccctaac cgcaagtgat ccgccagcct 
gaatctcgtt cactcagtgc tcaatggtgc 

45 

ctcgctacaa cctccacctc ccagccgcct 
gcctctgccc ggctgccatc ccgtctggga 
50 cgtctgggat gtgaggagcc cctctgcctg 
ccgcccggac gccatcccat ctaggaagtg 
ctaggaagtg aggagcgtct ctgcccggcc 

55 

gccccgccga cccgtctggg atgtgaggag 
tggtgaggag catctctgcc cggccgcccc 
60 cagccgcccc gtctgagaag tgaggagcct 
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gaagggaaca acaggcactg gggcctactt 


84120 


caaaaaacta 


ctattgggta 


rtf-ai- rrr" t~ t" a t~ 


84180 


aacccccgtg acatgccact 


a at" a f" a 
taCCLataLa 


84240 


ataaaattta 


aaaaacaaag 


aagaaacgc l 


84300 


ctttacaaag 


tcaaggtagt 


a a anaat" t" tr 


84360 


agcaagaact gatacaaccc 


acutagccgt 


84420 


gcagctacaa 


atcaaagcct 


tttagcaaca 


84480 


aatatataaa 


atcctcctat 


acattaatag 


84540 


aagccatgca 


cagccatttc 


acaggagaca 


84600 


aatttcttaa 


gcaattgtgg 


aatgaaagga 


84660 


tgaagacaac 


cactttaaaa 


actattccct 


84720 


cccaacaatt 


ccactgctag 


ttaatatacg 


84780 


tgacacgtac 


gaatgtttcc 


agagcaggat 


84840 


gagaacagat 


caattgtacc 


atgttcccta 


84900 


tacttgaagc 


aaatcaacaa 


agatgaatta 


84960 


gtcacaaaag 


aacatcttat 


taattctacc 


85020 


tctctccctc 


tccctctccc 


tctccccgca 


85080 


ctgtactgct 


gccatctcgg 


cttactgcaa 


85140 


gccgagtgcc 


tgcgattgca 


ggctcgcgcc 


85200 


ggtttcgctg 


tgttggccag gccggtctcc 


85260 


cggcctcccg 


aggtgccggg 


attgcagacg 


85320 


ccaggctgga gtgcagtggc 


gtgatctcgg 


85380 


gccttggcct 


cccgaagtgc 


cgagattgca 


85440 


agtgaggagc 


gtctctgcct 


ggccgcccat 


85500 


gctgcccagt 


ctggaaagtg 


aggagcgtct 


85560 


aggagcacct 


cttcccagcc 


gccatcacat 


85620 


gcccatcgtc 


tgagatgtgg gagcgcctct 


85680 


cacctctgcc 


cggccgcgac 


cccgtctggg 


85740 


atctgagaag 


tgaggagccc 


ctccgcccgg 


85800 


ctccgcccgg cagccacccc gtctgggaag 


85860 
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tgaggagcgt ctcccggcag ccaccccgtc 
ccaggccagc cgccccatcc gggagggagg 
5 acccgtccgg gaggtgaggg gcgcctctgc 
cctctgcctg gcatcacccc gtctgggagg 
tgatgacgat ggcggttttg tggaatagaa 

10 

atcagattgt tgctgtgtct gtgtagaaag 
gtactaagaa aaattcttct gccttgggat 
15 tgtgctctct gaaacatgtg ctgtgtccac 
gatgtgcttt gttaaacaga tgcttgaagg 
cctaatctca agtacccagg gacaaaaaca 

20 

gaaagccaga gacctttgtt cacttgttta 
tgaccctgcc aaatccccct ctgtgagaaa 
25 gagactataa acataatttg agtgtttaga 
taaataaata aaaataaaat taaaaaaatg 
aaagctggag aaaataaata ctaaagtgtt 

30 

aattagagga ggatgtgaaa gcacagcgta 
ttgaggtgaa agagaaataa ccataaaagg 
35 attggtttgg agaacggaaa taaaatctac 
gcacacaggt tggagcagag aaagaggaaa 
cattgatacg ttcttgccta tctctcctcc 

40 

acctgtgtgg gttctgttct ctagattgga 
gaggccagtt gaaaaatgag ggagaataaa 
45 agaagaggaa tgaaaaagta agttggttgc 
aaaaaaaaag ctaatgctcg ctgggtgcgg 
gaggccgagg caggagaatc acctgaggtc 

50 

tgaaacccca tctccactaa aaatacaaaa 
atcccagcta ctcggggggc tgaggcagga 
55 cagtgagccg agatcgctcc attgcactcc 
taaaaaaaaa aaaaaaaaaa gctaatgcta 
ccattccatg aacatgggcc caaaaggacg 

60 

atgtggctac cacagggaca acatagaaca 
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cgggagggag gtgggggggt cagtcccccg 85920 

tggggggtca gacccccgcc cggacagccg 85980 

ccggccgctc ctactgggaa gtgaggagcc 86040 

tgtacccaac agctcattga gaacgggcca 86100 

aagggggaaa ggtggggaaa agatagagaa 86160 

aagtagacat gggagatttc attttgttct 86220 

gctgttgatc tatgacctta cccccaaccc 86280 

tcagggttca atggattaag ggcggtgcaa 86340 

cagcatgctc gttaagagtc atcaccactc 864 00 

ctgcggaagg ccgcagggtc ctctgcctag 86460 

tctgctgacc ttcccttcac tattgtccta 86520 

cacccaagaa tgatcaataa aaaaaaaaaa 86580 

aatttaaaaa taaaatccca gtaaaaaaaa 86640 

cagtatgata ttgcaatgat tgagcattag 86700 

tgggagggtc acaataggct tcccagtaaa 86760 

ttccaaactt tacagttgca tttacatctc 86820 

actctggaac tacataaaga tacaagacaa 86880 

ctataaaatc caggtaaata tctgatactg 86940 

catagaggtg ccaaaggaac aaagtaagtc 87000 

aaatcaatgg gcacaaactg tggctggtct 87060 

gggatgaaga caagttcttg actctatgtt 87120 

accatgaacg aaacaagaaa gaaacaaaac 87180 

tgtacaagag ataaaacgga ctgcaaaaag 87240 

tggctcacac ctgtaaaccc agctctttgg 87300 

aggagttcga gaccagcctg cctaacatgg 87360 

attagctggg cgtagtggca ggtgcctgta 87420 

gaatcacgtg aactcgggag gcggaggttg 874 80 

agcctgggtg acaagagtga aactccatct 87540 

agaaaggtga ataataggtt caactgtatg 87600 

aggccatgcc ccactgggca cttgatgatt 87660 

cagaagtaat actggagccc ttggctttgt 87720 
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aacacatttt aattaaggta cttctttctg 
gggaaagaac attcattttc accagatcag 

5 

ctgtggatgt actgggtgat tttgtgaaaa 
taaagtactc ttgaaaataa taatcatctt 
10 atgaggtaga aagaaactat aaagaagtaa 
atcacagagc aaattgctga ggttttctct 
tcagagctcc aatattttta tgtgttcctt 

15 

tcttattttt aaaaaaagga caactagggg 
tcctactaag tcaataggtg gaatctataa 
20 tgtatcattt acagaattgt cttcatttca 
gctctgtctt taattccttt tataaaacca 
tcctaataac tatagaccta gccatgaata 

25 

tagtccagaa actaaacagc agatatgaaa 
attgagtttt cctcataaaa tttctcgaga 
30 agaatgtaat gaaatcatat acatgcagaa 
taaaagctca aaaaaattct tgttctggtt 
tttctttgct cttgaaattg accctgttca 

35 

atccactgcc taaaacaaga tagaatacct 
tttaagctct tgccatgtgc taggcactgt 
40 tgaactttag actagcctcc ttaggtacct 
taacagggat ttcattaaga aacttgctta 
aagattagaa aagagggctt gagcaatatt 

45 

taacctcaca ttctcaaatg ctattgagag 
atggacttgt ttcctttttc cttgtttaaa 
50 attttttttt ttttggagat ggagtctcgc 
gatctcggct cactgcaagc atccacctcc 
tccaagtaac tgggaataca gggggacacc 

55 

tgtattttta gtagagacgg ggtttcacca 
ctcgggatct gcccgcctcg gcctcctaaa 
60 aaattcctct gtaatcagat atttcagaaa 
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acaagaaaac 


aattaaaatt aaaataggta 


87780 


cattttgtga 


aaggtgatat aagtgtatac 


87840 


tgattcggtt 


tcatgttttg ctttatgcta 


87900 


cactctgtca 


aaagtgaagt tagcatgaaa 


87960 


atttttccca 


tttttatttg aaataatgta 


88020 


aattgcttta gtcttatact tttgatctca 


88080 


gttcttaaaa 


ataattgcag ttgttctaat 


88140 


taaatatttt 


gtcaactgtc aagagaacaa 


88200 


aaggagtcta 


tttattagtt tcccttctca 


88260 


attgaataat 


taaagaatca gaactggctg 


88320 


gaacttgacc 


cagtaaaaat gtattctgag 


88380 


atctgggttc 


tatgtgaaat ttacagatct 


88440 


ggagaaactg ttgataaata aatgaaaagt 


88500 


caaaaaaaaa 


gcaatgatca acagcaaagg 


88560 


acgtggacaa 


ttctttaatt gtatgctttt 


88620 


aattactgta 


ctctgagtac ccatctccta 


86680 


attcctttta 


taaacttttc ctatggccac 


88740 


tgggatctaa 


taataaaata atagttcctt 


88800 


tccaacagtt 


ttgcataaat taggataatt 


88860 


tctgtcattt 


tacagttgag acataggaac 


88920 


aagtcccaga 


cttagtgaag tagcagattg 


88980 


tggccaccga 


aaccagatcc ataggatgta 


89040 


ttacgtggtg 


ttactccaag agtcacagct 


89X00 


ataaactttt gtgggctcaa atgtgtttac 


89160 


tctgtcaccc aggtgggagt gcagtggcac 


89220 


cggggtcaca 


ccattctcct gcctcaacct 


89280 


cctacacccg gggaaatttt tttttttttt 


89340 


tgttagccag 


gatggtctcg atctcctgac 


89400 


gtgctggatg 


tgtttccatt ttgaaaaaaa 


89460 


gatgttttct 


cccttttcat tgagttttca 


89520 
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accattttca gttgcatcta gtatcggagg 
aattctctca ttgtgaaatg gggattttag 
5 gggtacagag ataatgtatc tcatgcactt 
taatcggcac aattccacat actttgcaga 
ctaaggaagg tagtcttatt agctctattt 

10 

taacaaactt gccaaagaca caaagttgga 
gacactatac agaccctact gtgcctataa 
15 cacctcacct aaagcttgct aaagagaatc 
aatccggttt ataaactcct ctccatggca 
ttgcaggagc agagtcaata ccctgtcacc 

20 

tccagagtga tgctgcatgg agggggattc 
gtcctaactc tgccacatgt tttctgtgta 
25 tcaatttctc atctgtaaat caaggggaat 
accctaaaat ttctggattc tatgagcaca 
agtgatacat tgtaccaata ggttattcat 

30 

attcatcact ctaactttct tgatattcat 
acctcaaaaa aaggaatcta aggctaaaga 
35 agaattaata gtcattgttg tccatattca 
ccttgcctac atggggcctg agatgctcta 
catttttcct aaagaacaga agaatcatct 

40 

ccaattactg gcaaacaatt ttaatcttgg 
tacttggccc ctgtattagt tttctatttg 
45 gcttaaaaca atataatttt tttatctgac 
tcactagact aagttcaagg tgtcagcaga 
gaatctgttt ccttgtttca tccagcttct 

50 

cgccttcctc caacttcaca gttggcagca 
gatgaaacta ttccacctca ggtcatcagg 
55 agatccccaa catgtgcagt tcacagtagg 
gctggtctga taggaggcag agctcaggca 
gtgcagccag gtttccaaca gaccacagac 

60 

acttatgaga atcatgtttc tatcagttgt 
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gtaagggtta tgagtacagg ttctggatta 89580 

tagagtacct atctcatggg gttctgtaaa 89640 

aacatagaaa taataataga gccaaccctg 89700 

gaataactct ttaatactca aactgactct 89760 

tacagacaga aaacaaaggc aaagagaggt 89820 

gttagatccc aagttcatgt tcctaaacat 89880 

taagtcttta ataaatgtta gctctctata 89940 

cttactaact cactcaatca caaagaagtt 90000 

tcagaagaca gaagttttgt tgttgggtag 90060 

aaggctggcc tggaaaacaa gctgcaaatg 90120 

tgaaccaaca gccaagaatc ccattttcaa 90180 

accttgagca tgtcacttaa cgactcagct 90240 

gagacaggat aaatggaaag ggtcttttct 90300 

gggtctgcat atgtgtcctt gttagatacc 90360 

agtattcata cttcatacca agaatgagac 90420 

aagcacaaga aaattgttgt gcttttgaag 90480 

caaatttttt aagtgtcata atttttttca 90540 

gttgtactgg cttatttcca acaaaatttc 90600 

agctttcaga aggacagaaa tgaataaaag 90660 

taagtgggca tatgtgaact atctagtttc 90720 

actgccctcc ttgccataca agtattgtat 90780 

ttgatataac aaactaccac aaatgtagtg 90840 

agttctgtag gtcagaattc caataccagc 90900 

gtcatgttcc ttcctggaga ctctagggaa 90960 

agaggcgacc catacacctt ggttcatgga 91020 

tagcatctct ctaatgaggg atggtttcag 91080 

aattagagtc tcataaggag catgcaacct 9114 0 

gttcatgctg ctatgagaat ccaatgccat 91200 

gtaatgcttg cctgctgctt acctcatgta 91260 

aggtaccagt ccttggcgtg ggggttgggg 91320 

tatggcccat tatctgctca cttgttctct 91380 
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ccatttgcat agataattga catttgacat 
gataaacgta tcacttttgt atgttttgtg 

5 

caacaagcca tgctgaagta aatgaaacca 
tggctcctgg atttcaacag cctctgggtt 
10 ctcagcgtgc tcagccctac ggcatcacat 
gtcaaggaaa tatacaaatg ataaatccaa 
aagaagcaaa ggcactaggg gtaagtctat 

15 

aggtcttctt ataagttata ggagagtatc 
tctgaaatta tctcttttag acagaaatat 
20 catttcattt tagatcaagc aaaaatataa 
aatttggtaa tattgacgtg acagaagcac 
aaggactttt tggacaatct caatcccatc 

25 

taaaggtgaa tttacataac aatgaacaat 
atgggcagtc tgggaaaggc tgagtgtgaa 
30 tctttcacca atacccttag agatgctaca 
gactctgaac tgactgcctt gtttctgtag 
aagaaaattc cacctgccta ggtagaattg 

35 

tcattcatcc cgctagcaca attccaccaa 
caaaatgctc tg.ctggagca ttccgtaaat 
40 agccatctta ctatgtattt caagttgtga 
tctcctgcag cagcctattt actaagctgt 
agatagagga aattccctgc tcctaataag 

45 

cagccccaca caggtgcttg gaatatcatt 
acagaactta cgccctagag ggggaaatca 
50 ttagtgtgtt agaaggtgtc gaatgatata 
aagtactagg cagtgaggag gcagcaattt 
catgacattt gagcaaaaac ctgaagttag 

55 

aaaacaacca gtgtaaaagt ctttagatga 
aggaggccaa taaaatagaa cagaatgagt 
60 tgagtggcag aggccacagg tggtgaggcc 



ttggtaagat 


tttctttagt 


ttcttttctg 


91440 


atacttagga 


cataatgatg 


tcatccaagc 


91500 


tacccaaccc 


ttacccacca 


ggcagcttta 


91560 


caatcaactt 


agaaaaccaa 


gctcagggtg 


91620 


ctccgggaat 


ctttgctagc 


agtcaaccgg 


91680 


gtgtgggaac 


agcagtaatg 


aactttaaag 


91740 


ttactaccag 


aattttaatt 


tcacattgca 


91800 


atccaattgc 


taaaaagttc 


tgaacgttat 


91860 


ttagtctgcc 


tatagacttt 


cccttgagtt 


91920 


caagacacag 


aggagcctcc 


ctagctggag 


91980 


tgttacatct 


agcttgatct 


catttcttca 


92040 


tctggattag 


ttacaagtct 


ggtttttgca 


92100 


gatggctgcc 


cccagaaaag 


gaaactacaa 


92160 


agaattgggg 


gacattccat 


ctgctatgcc 


92220 


gtttaactgt 


caccttactt 


ctacaaagga 


92280 


gcattgtaaa 


aacaaaaatg 


ccctacttgt 


92340 


ttagtccctt 


ctccgtgctt 


tccaaataca 


92400 


attatattaa 


gtctctccct 


gtgaatagct 


92460 


gaatgctcat 


tatgtgccag 


aaactcttct 


92520 


tgttttaggg 


aagtaaaaat 


gagcataagt 


92580 




cttctgatcc 


tcctcctccc 


92640 


ccaatgggac 


aaactaacac 


cccatacagc 


92700 


aagcacaaag 


gaggtgcaat 


ctctgccctc 


92760 


gactgtatac 


ataataagga 


agtaaagtac 


92820 


gacaaaacaa 


agaagtgtaa 


aagggctaaa 


92880 


caaggtggtc 


agaatgggct 


ttccttagaa 


92940 


gagtgagccc 


tgcagatatt 


agggaaagag 


93000 


aaatgtcttt 


gaggtatttg 


aagaaccggg 


93060 


gagggagggg 


gcagttgtca 


ggagggcaga 


93120 


tgcagagcag 


aaggctttgg 


ctttgactgg 


93180 
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gtggatgagg aaccactgaa gcatgctgcc 
tttaaaagaa tgtgaagttt tgttttcaaa 
5 tctgaagatt tgttatttct atgccatcag 
ccacaggtgg caggacttgg cacacctgta 
tggccaaaaa tgcagacaaa ttgtatgtcc 

10 

tgatcagtag atggtacagc cagggctgta 
gatctaactg cccctcatca aggtgacagc 
15 cggctgaatt gctgctctaa agaacagttg 
gttctttaag tgagataggc taaacgtggc 
tatggtatgt gtgtgtatga atgtgcaaga 

20 

tttattgatg tctcaagaaa cttttagcat 
taattcgatt tacttgtgtt cagtagctat 
25 tatttgcagt ctccctggat agataagatg 
acaatcaagt tttggaccaa tctagaaatg 
aatgaaggaa ggttcaataa gaggtacaag 

30 

aatatatgat tgataagaag aatgattggt 
aaaaaaggac aatatacact ttgggaggcc 
35 cagaccttcc tggctaaccc gtgaaacccc 
ccaggcatgg tggcgggagc ctgtagtcct 
gtgtgaaccc gggaggcgga gcttgcagtg 

40 

gggcaataga gcaagactct gtctcaaaaa 
tataaatgaa tccaaaatca agtaataagg 
45 ttcagaaaga aagaaaaaag attaatttgg 
gagcagagct gagcatgggg gaagacagta 
tgtgctgcag taaattccca gatctgcctc 

50 

atctttatgt ctctgggact caatccagtt 
aagtgttcag aacatgacca agcacataat 
55 aattccttta cccgtaactg ggatgaccca 
ctactgacat cacagttaaa gtattgtttt 
gctaatgcat gattaagcag ctgtactttc 

60 

attctccaaa tacctcaaaa cagggcgttt 
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cggcaccgaa gaggcgtgtg 


ataaaaatac 


93240 


atctccagtt ggcagaagct 


acttcctctc 


93300 


caggccctgg gcatcatgac 


agtgcagggg 


93360 


ccagtcaact cagaccttaa agtgcttctg 


93420 


aaatttttaa 


gtctgtccta 


taggcaattt 


93480 


ggaggagaaa 


tgagggccag 


tacaaatcca 


93540 


aaggatgaga 


atgggaagtt 


tgtgaccaga 


93600 


taaaaaccaa 


atataaagtc 


taatacagga 


93660 


ccatcatctc 


tctgaagcta taagcaaaag 


93720 


ataatttttc agggagaaag gttcatggtt 


93780 


tgtttatgaa 


actgggtttt 


tttctaattt 


93840 


ttattgaaga 


aataaataca 


gcagtctttg 


93900 


taaccatggg gaaaaataaa 


ttacaaatct 


93960 


attattcctc 


agaaatctat 


ctttatttac 


94020 


gaggtaaaag 


atgtatacct 


tacttttagt 


94080 


aagaaaatga 


ttgcatacct 


gcaacttttt 


94140 


aaggcgggca 


gatcaagagg 


tcagaagatc . 


94200 


gtctctacta 


aaaatacaaa 


aagaaattag 


94260 


agctactcgg 


gagggtgagg 


caggagaatg 


94320 


agccgagatc 


atgccactgc 


actccagcct 


94380 


aaaaccaaaa 


aaaaaccaaa 


aaaaaaacaa 


94440 


tgtgttttat 


gtcatgagat 


gcaactgaaa 


94500 


tgtaatcaag aagacttcat ggaaaaggta 


94560 


ccgaataatt 


cagacaggct 


cagcagtcaa 


94620 


ttaggaattg 


tgtgaccttg gatacctgat 


94680 


gtaggaggat 


tacatgaaac 


aataaatgta 


94740 


tcatgcctaa 


taaatggttt 


tatcttcacc 


94800 


agcagagaac 


tgatccatct 


gaaaaagcag 


94860 


tctaatagtc 


ttttagaggc 


cctcagagat 


94920 


tgaagaattc 


ttcttgggcc 


accccagtaa 


94980 


cactaagggc 


cttgcactcc 


agttattcta 


95040 



-269- 



WO 02/062946 

gtatatgtca gaatcttcat gacccacctc 
atggatctca gcaaaattga caataaataa 

5 

acgtctgaaa gtgaagaagt aattttaaat 
catggttgga ttgatgcaca ttggttttgg 
10 tagagaagta ttaggttttg cctctactgc 
cctttctgtg agtagattgc tagaacacca 
tggtttataa agtattgcta tctccagcca 

15 

catctgaggg ctggagaatt tagaaaaatt 
gaaaaattaa cagctctgaa tgagcatgaa 
20 aggagcttat tccattcttt aaaatttaat 
cctactacat ttgagatgga aatgtatgta 
gattggaaac atgattaaaa atatgcaagt 

25 

cccagctact tgggaggctg aggccaggga 
ctgggcaata tagtgagact caatctctaa 
30 attatgagaa gggatggagc agtaagcctc 
cttcataggc cagaaattaa ggaaaagaaa 
gagttactcc agaaaattca gaagtgcgag 

35 

ataaatagac caacctttca tataaagaga 
cacaaataaa tttgtccatt tccacagttt 
40 tccaaggagc tttcccgttg tctggtaagt 
aagattagct taacatattg gggaggaaaa 
ggagcacctt tcccaaaaaa aatctattgg 

45 

aaatggtcca cacagtttgc cactaagaaa 
tcacaaaggt gagaagtagt aaaaatcagg 
50 ggttcagtct ggtcacactc tattatggga 
tacatactag acacatcagt taccaagtct 
tcttgtattt ttattttttt atttgtgcta 

55 

acaattcttt aaattgccat taaatgagtc 
atgatgagta agacaggact ttgagctctg 
60 cgtggtctac tattgaacca ggaaaaagaa 
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ctgcttcaga tgtgtgcctg 


aatgacaacg 


95100 


gcaggggcag caatgacagt 


aatgtttcac 


95160 


ggctgttttc ttttttcagg 


tgatccagat 


95220 


aattgttttg tgtttaatat 


ccttctcttt 


95280 


tgttattggt ggatacccat 


tctggggtgg 


95340 


gtccttcttg gtttataaag 


tattccctct 


95400 


attcacaact catttcttga 


taaagcccta 


95460 


aacagctctg aatgagggct 


ggagaattta 


95520 


aaaatcggga actctttcgc 


cattttacct 


95580 


catgtaatca gcaagccaaa gttttgatat 


95640 


ttgggtggag agagaaagaa 


aagaaaaggt 


95700 


aagccaggca tagtggtgtg 


tgattgtagt 


95760 


attgcttgag ttcaagagat 


ggaggccagc 


95820 


aaaaattttt taattaaaaa 


aatgcaattg 


95880 


agaacaagga atgcttaaac 


attgtgagtt 


95940 


ataaaaaaga agtcaaggac 


atatgactga 


96000 


agaaaatttt gagaaggaaa 


ctagatgaac 


96060 


aattcccaag cacaaccagt 


taacccattt 


96120 


attatctctg gctctctctc 


tgtgtcagca 


96180 


tagactgtct ctactttttg 


aaccccttta 


96240 


ttgaaaagct ggatttggga 


aatgcaaaag 


96300 


ttgccttatg gtctgagcgc 


tggggtgaga 


96360 


tggcagagct ccatatcctg 


actcttgaat 


96420 


gttgtttaag ctttaaccta 


aaagctccct 


96480 


agtccaaagg agaatccagg 


agccaggaaa 


96540 


ctactgttca tttgccactt 


ctcagctctg 


96600 


tttcctactc ttacacaggt 


tgtgtgtttt 


96660 


tagcttgttc tgtaagaaga 


gtttataaaa 


96720 


tcggtcacag caccccacct 


attccaattt 


96780 


agtgatttat caagaactag agtcaactag 


96840 
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agcagaggtg gtgtcaagat tatatgtaat 
gaacacagaa tcatgtagct catgctcttg 
5 aattaaaata aaataagcct aattctgatt 
taaagcaccc tttctctgca tcttttaagg 
atagattcca cctcttagag gagacagcca 

10 

agcttttaga gaacttcagg aggctctaag 
actcgactgt gccaccccta gcaagtcacc 
15 catgttaaaa tcctattgta gaaactgggg 
ccacaaaagc ctaaattttc tccttctaaa 
acttagggtt ttcattaatt cctaaatttt 

20 

ccatctataa aatcatatgc tatcgtacta 
atttaaataa tttaacacag tgtctgacac 
25 cattttttta taaagaaaat atagttgtta 
agatgagtgc ctgctttata tgatttacat 
tctaggctgc atacagaata atgtttatgg 

30 

atttcaactt gaaaaattgt tttgaaagtt 
ttcttatctc acttaacttc agtgatcctg 
35 ttaggtgaaa ggcagcctgg gaatgaacat 
gattctgctg ctggtggata tgtgcatcaa 
ggtaagtatc ccatactcca ccatgtgcct 

40 

atgttttatt tcttaatacc gtataccagc 
cccaccccta gtcatatgat ctctaggact 
45 gatagaccga ggcttaaagg acatccttta 
aatggggaag ggtttggata ggcctccatg 
caacatttga aattctatat aaataaggta 

50 

tcagagtaaa gtatgaagtg agagggaaat 
agaaacccag gaatgaagca attgctaaat 
55 acaatcacac attcattagc cttatatagc 
gggtggccca agggcaggaa tgcttaccat 
tgaggttggg ctggaagctg tcatcctacc 

60 

attctgtgag caatggaaaa ccacagaaga 
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ccagtacaag aaagaagaac 


ggggaaagag 


96900 


cagacaaaga agaataaaat 


aattaaatca 


96960 


aaccactcct gttgcagtcc 


aaaaacaaaa 


97020 


atccttccgt ctaattgaga ttgtcgaagc 


97080 


ctgacacata cctaagacaa 


tttttctaag 


97140 


gcacgctgca gcacaagcac 


attctaaaga 


97200 


tgcggacttg gtttcacaat 


tattcttctg 


97260 


aaagttttgt agtcgcccat 


acacctctgc 


97320 


tttgggggtt catatttagt 


ccccagagag 


97380 


tcagcaaaca tgttgtgtct 


aaatactttc 


97440 


cctcattcaa aggctgtaag agttgagata 


97500 


agaaaaatgc tcattagaag 


tttgtttttt 


97560 


ttgtgcatga agcttttagt 


aaaatccaat 


97620 


ttttctacag ctttttagaa gtaccctaca 


97680 


taattaccca gaagggcaga aaacagaata 


97740 


caaaaattaa gaaccatgaa atttgattga 


97800 


tatataattg attttttcct gccatctcca 


97860 


tgttagttct atcttggcct 


tcattggagt- 


97920 


tggggtagct ggccaagact 


actgggccgt 


97980 


gctctatttt cagtcccgta aagtgggtct 


98040 


tctagtggaa aatgtttttt 


ctattctagc 


98100 


atagaataaa gtgagagggt 


tacctctaca 


98160 


ttggcccatc tcactgttaa 


tcacatgaac 


98220 


gcttctaaaa ttgtccaaaa 


attattttcc 


98280 


gtcaagctat tgagtgtttt 


ttgttaaact 


98340 


gacagggata agaacatctg 


atttttacag 


98400 


gaagcacgta tctctgaatg 


tgctcaaaac 


98460 


caagcatgga ctctgacctt 


aagaggaaat 


98520 


ctataaggaa tggagcaaca 


tggtataaaa 


98580 


tcaatggcca cattaaccat 


tttggtcttc 


98640 


gtggttttat tggtttcatt 


ttattttatg 


98700 
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ttggggttgg ttttgctttt gcttttcagt 
tgggctacat atatcttaag aaagacacag 

5 

aacattaaga caatatcaca atattttgaa 
gttcactcaa tactgtgtaa gtaattctat 
10 aagccatctg cttctcaact aaagggttct 
tgaaagttgt ctaggcactc ccatcagatg 
tcagtagttt gaattgctgg cacactcttt 

15 

aaaaacatag gagagtttta ggccgggaat 
caacactgtt tgcagtgtaa ggaaaagctt 
20 actgctgata aagaaatacc caagactgag 
ccacatggct ggggaggcct cacaatcatg 
tatatggatg ggggcaggca aaaaaaatag 

25 

aaccatcaga tctcatgaga cccattcact 
ccataattca atcatctccc accaggttcc 
30 caagatgaga tttgggtggg gacacagagc 
gaaagcttct cggtaaataa ggcagagtga 
caaacaaaag aacaaaaact attcaaaaaa 

35 

gaaacaaatc attctgatgc tactcaaagc 
cttgtttccc tacaaacaaa actgaggaaa 
40 tctcactaac actctccagt ctgtagctga 
tgaaaggata aagcaaatcc tgtatggtac 
agacagaaga aagggcaaca ccttcccata 

45 

atgagcagaa atgcacagga atgcacagga 
tactaagcat ttagcaggct ctcagaagac 
50 gaactgtgct cgaagccagc taagctatct 
ttcagaaaca tggatataaa acaaccgcta 
caaatgggag ccaaataaac agaagtgtca 

55 

atcaaataga aagcctgtat agagttcaag 
taacatgatt ccattcttat agaatgatgc 
60 tctcacggca tctttccaca catccattct 



ttacaaaaag cagagtttat taaattactg 98760 
aatgacttcc ttatctatct agaacatatt 98820 
caaataacca caataacatt tcctttatca 98880 
ttcactggat ctcggagtgg caggcttatg 98940 
gaaatcctga ctccaaccat tggtataatc 99000 
cgtgtcccct agagtatgtt ctttgaagtg 99060 
ttccaccaag ctctgagaca atcctttgtt 99120 
ttgaatgatc acttttgtgc ttggaaagga 99180 
atataagaag gtgtattagt ttgttttcac 99240 
aagaaaaagg ggtttaatga gcttacagtt 993 00 
gcagaaggca aggaggagca agtcacatct 99360 
agcttgtgca gggcaactcc tgttttttaa 99420 
gtcacaagaa cagcatggga aagacccacc 99480 
tcccacaaca agtgggaatt atgggagcta 9954 0 
caaaccgtat caggaggcaa aaatagatga 99600 
agcaagaaca aaattcaccc tccaaaatac 99660 
aaacagcttt caaactggca tccaagtaga 99720 
tgaaaaaaag cacagctttt agacctggct 99780 
gaagggaagc caaaaaaaaa tactgagaag 99840 
gaaggctggg gtggagagca gtcaattctg 99900 
aagaccatga gtagtaccca ggacctcagg 99960 
ccctgcatgg taccctacta ccagcaggga 100020 
ataaacctcc agccaaggag caggcctcca 100080 
agaaaaagtg aagtccatcc tcgctagcaa 100140 
ctctccctgc aacagctata tatgttcaca 100200 
gggctcaagt tgtaatggaa gcaaggaaga 100260 
atgaaggaaa attcacctac actgtacctg 100320 
tgagcttaat acttttattt aaaatagaaa 1003 80 
tcctctctct tggttattct cttttttccc 100440 
tctgcctttt ccctgtacac agagagctag 100500 
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gataatagtc atcaaacatt aattttggtt 
gttggtttct gtattgcttg acttttttaa 
5 acgaggccat cacaaaggaa aaaaactacc 
tacttaccaa aaggaaagac ttctggcatt 
ctctggaaca agcatttgat gatactggaa 

10 

aatagcacat ttgatgagtc aataactttt 
caaaatttca ctatttattt atgatttcac 
15 tgatcttctc cctcttggag ttcttcgtag 
caaacaccac aaccaatatg gtgagttggg 
cctggagaaa tacagacttg tgtctgctaa 

20 

tgtccttcac tctgatggta ccaggggaaa 
gtttttattc attctttcca gtctgtcctg 
25 gtgacaccag cgtcttcttc agctcctccc 
aaatagtaaa agaaaaaggg gtatcagtct 
cttcttaaga agatgtcttt tattgtctac 

30 

gagatttgtt tttaggttgg tcgctaatga 
acattaccac tactacatgc tggcaaaggt 
35 caactctctt aaagttagaa atgtttctgt 
ttagaaacaa aagccttttt cagtatgttt 
tagagcagta gatctgatct atactccaag 

40 

ctaggtgtcc agtcactgga gttccctaac 
gcaagtgtcc aattgccctt gttggaggtc 
45 atcttgcctc ttaactatca tctacaaaat 
ttaaactcac ctgcaatggg acaacagtca 
ttgagttttg gctttttgtt tttgaattgt 

50 

aaggtaagac agaagggatg aatgacagtg 
aaaacatcac ctaagtcatg aaaaagggca 
55 tcacagatat ccacagcttt gcaatagtgg 
atccctgtca aagcagtagt gatgtcatgc 
gttgaacaga atggtaacaa gggaaaagtc 

60 

acaggattgc caagcggtac agaaggcaca 



PCT/US01/48437 

atttctaaac actggcattt ggaatgatcc 100560 
tgagtagtaa catttgtatt aaaaaaaaca 10062 0 
ttctttctaa tatttctaaa caaaatagcc 100680 
ttaaagaaaa tccaaaataa attcacactc 100740 
gaataagcat ggagcgagtt gagtataagc 100800 
tgaagcccac gtgaggccct cctgattagc 100860 
tttctggaaa aggcatttca gccacgctga 100920 
cttgtgccac agcccatttt gccaaccaag 100980 
tctcttcttt gtaaaataat ctgaaaatgc 101040 
atcctactat cggcttacca gaatgctctt 101100 
ttgcatagga ttgtagtaac catattgctt 101160 
gttattccaa atatgtatga aagcaaccct 101220 
agatgcaaca actactcagc taatgcccct 101280 
aatctcatgg agaaaaacta cttgcaaaaa 10134 0 
aatgatttct agtctttaaa aactgtgttt 101400 
tggctgtatc tcccttcact gtctcttcct 101460 
gaaggatcag aggactgaaa aatgattctg 101520 
tcatattact ttttccttaa taaaatgtca 101580 
aaaaaattaa ttttgatttg atattggttg 101640 
aagattgata aacctttgat gatgattttt 101700 
tcatctcctc ctcatctttc tgttcagaaa 101760 
atattattct atacattaat atcatcaaac 101820 
tttgctattt acagagtatt ttcatgtaca 101880 
gacagggatg ctgcattgag gggttttcat 101940 
tggcacattt ataagccaat tagaacaatc 102000 
taagtttcaa gaaatcaaaa gcacagatgg 102060 
gtttctctac aaaatagaaa ggtataatta 102120 
cagtgaggat ccgatgtgat ggctcatgtc 102180 
ctaaggtaag ggcacaggta gagtcagaat 102240 
atggaactgg agaataaagt ggtcggagaa 102300 
tttgaggatg tggtaacaaa ttcagagtat 102360 
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ggacaatgct tctgcaaggc cttgctgcta 
tttgattttt tccagggtta agattttgct 

5 

gaaagcttag gacattgtaa aaaatattat 
ataaggaagg aataatagta ataataaaag 

10 taaatggaga aatctgcctt ttctccggga 
atatgaatat ggtttaaaaa tagcaaagta 
aagaagccag aagggaaaaa tataacaaaa 
gaggaagatt agaagaacag aatttttgca 
agagtcacat aatttcaacg tcattggtct 

20 gattggaaat gctgtgtcct tcaccataaa 
gatttcatgg tctaagggtc tggaaaataa 
gtatattgta tatctataca cgtaaaataa 

25 

cagaagcatt tccagcaagt tcattttt.ta 
tcaaattgct taataatagt aataataaca 
30 tcacgttcca agcactgttt caatgctcat 
ttcagatgaa aaaactttgg cagaaaagtt 
aaatggtgaa gcatggattt gaacccaggc 

35 

taaccattat tcccactatt cttatggtct 
acaaacagaa atgtttctct atagtaagct 
40 ctctgagaga ggagatttag taacataata 
gaactgcgtg gctccactaa tgtgcatatt 
agaataaact gatcagagaa acagaggatt 

45 

ggttttagca aatttatact atgaacaaat 
tagaaggtga ctggaataga aatatgatat 
50 ccctcatgcc cagccgaggc tggactgtac 
tgcctgattc tcctgcctca gcctgccaag 
tctgactggt ttttgtattt tttggtggag 

55 

ctccagctcc tgaccgggag tgatctgcca 
gacagagtct cgctcactca gtgctcaatg 
60 tcggctcgct acaacctcca cctcccagct 



gatccaggca actacggatc caggaaacta 102420 
agaggagatg acaaaatagc agaagagcaa 102480 
ttaaatgatg gactgtggaa tctaaactgg 102540 
caataataaa gaaaaaagac agattgatca 102600 
gctgtcgtga caatcaaatg agataatgtt 102660 
ttgtcccttt ttgaaatttt aatattatta 102720 
agagtaggca gagaaatcaa aatcagggaa 102780 
tctgggaaag cagatgtatt atgtcaggca 102 84 0 
gcctaggtcc caactattca gcaactgggt 102900 
gctcaacagc tcagcaaaag tgaagacaaa 102 960 
atggtgcctc aaaatgtggg ggaggggatg 103020 
acaaatataa aagtcactta ctttagggaa 103080 
agtaaatttc tatcaaatat attctatttt 103140 
gcatatattt tagcccattc cttgtaccac 103200 
gacaaccacc tgtgaggcag gtggatcatt 103260 
aaaaatcttg tcctaagata tgtagctagc 103320 
agtatgtttc ccaggtccgg gcagttgccc 103380 
tggatagaat cagacacttg taatgccaaa 10344 0 
tgtaataaat ttggaagaca ttgccttatt 103500 
atacagttga cacttaaaca acatgagttt 103560 
ttttcaatgt gggaaaagtc atggaaacgg 103620 
accaagctat actgagggta catttgaggg 103680 
ttctccagct gggcctgtga tctgagaccc 103740 
tgccctctcc ctctccctcc ccctccccct 103800 
tgccaccatc tcgactcact gccacctccc 103860 
tgcctgggat tgcaggcatg tgccgccacg 103920 
accgggtttc gccgtgttgg ccggtctggt 103980 
gcctcggcct cccgaggtgc caggattgta 10404 0 
ctgcccaggc tggtgtgcag tggcgtgatc 104100 
gcctgccttg gcctcccaaa gtgctgagat 104160 
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ggcagcctct gcccggctgc caccccgtct aggaagtgag gagtgtctct gcctggccgc 104220 
ccatcatctg ggatgtgagg agcccctctg cccggctgcc cagtctggga agtgaggagc 104280 
acctcttccc ggccaccatc ccgtctagga agtgaggagc atctctgccc ggccgcccat 104340 
cgtctgggat gtggggagca cctctgcccc gccgccccgt ctgagatgtg aagagcgcct 104400 
ctgcccggct gcgaccccgt ctgggaactg aggagtgtct ctgccccacc gccaccccat 104460 
ctgggaggtg aggagcgtct ctgaccgccg ccccgtctga gaagtgagga gcccctcggc 104520 
ccagcagcca ccccgtctga gaagtgagga gcgtctccgc nnnnnnnnnn nnnnnnnnnn 104 580 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnaaaaa aaaaaaaaaa 10464 0 

104644 

<210> 80 

<211> 95109 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1).. (95109) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial^coding_region 

<222> (60039) . . (60257) 



<220> 

<221> coding_region 

<222> (62556) . . (62678) 

<220> 

<221> coding_region 

<222> (66166) . . (66225) 
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<220> 

5 <221> coding_region 

<222> (67828) . . (67959) 

10 

<220> 

<221> coding_region 
15 <222> (74200) .. (74313) 



<220> 

20 

<221> coding_region 
<222> (74489) . . (74590) 

25 





<400> 80 








nngtttcaaa 


ttcctgggct 


caagtgatcc 


30 


tacaggtgtg 


agccaccaca 


cctggcccct 




atgaattcat 


tagtcttcca 


cccgggcttt 


35 


attttagcag 


aattcatgtt 


tctctggtag 




ctaagtgcct 


caaactaaat 


cctgttaaga 




ttggtgcaaa 


aattttttga 


aatccgtgcg 


40 


actttttgaa 


gaccccttgt 


attaggttga 




tttaatctac 


aaaattgtca 


gtattgtatg 


45 


atatgcaatt 


tggaatatga 


tccttgtgaa 


aggttttaga 


atactatgaa 


ttaccaatat 




aggcaaagtg 


aaaatgaaat 


tcaatcatta 


50 


tttcagttca 


ttatggataa 


aatgattcct 




tatagttcag 


tataataagc 


agtatataac 


55 


atactttccc 


ccgttctata 


attaaaaatt 




cctgtaatcc 


caacactttg 


ggaggctgag 




accatcctgg 


ctaacacggt 


gaaaccccgt 


60 


cgtggtggca 


ggcgcccgta 


gtcccagcta 



acccacctcg 


gcctcccaaa 


gtgctgggat 


60 


cataaacttt 


ttgtaaagcg 


tcaatagttt 


120 


gacatcaatt 


ttatgcttgt 


tctttcttca 


180 


gggctctttt 


caaactgact 


tcttatcctt 


240 


catgttataa 


caattttgtg 


tgagtttatt 


300 


tagtttttcc 


atagtatgca 


tttcccacaa 


360 


accatttgaa 


attgctgata 


tcagaccatt 


420 


atccaatcca 


atgtattaga 


ctattccaat 


480 


atcaagagga aaataataac 


atagatatat 


540 


tgacacagtt 


tttgcactaa gtttcctgat 


600 


tgtgaataaa 


ttatctatta ggcagaaatg 


660 


gtaatagtac 


tttataacca 


tttattcgtg 


720 


agtatataac 


agttatatat 


atactgttat 


780 


agaatacatg gcggggcacg gtggctcacg 


840 


gcaagtggat 


cacgaggtca ggagattgag 


900 


ctccactaaa 


aatacaaaaa 


attagccggg 


960 


ctcgggaggt 


tgaggcagga gaatggcgtg 


1020 
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aacccaggag gcggagcttg cagtgagctg 
acagagtgag actccgtctc aaaaaaaaaa 
5 tttctgtttg gagatactga gcattctcag 
tatttcaccc aagaaaactt ccaggagtgg 
caagatggat gcaaggagga agcataggtt 

10 

agaaatcatt atttcattac taagagtaaa 
ctagtcttct ctagatcaat aatagttcta 

15 gagtaatttc tcctgaatga acacttgaaa 
tacttcaatt atccatattc ttataaagtt 
aacacgaaat gttttaacaa cagctcagcc 
aaattatagc attctttatg cctaatgtcc 
ttcacaatat gaattttatg aaatatttta 

25 tctaccctat ttttatttcc tttatctcta 
ggaaccattc ctttgtattt aatataaatg 
tactttatat gattaataca taaaaatgta 

30 

ttggggatag gtatacacct atgaaactat 
cacatcccga aggttcttcc cacacccttt 
35 tattaacatt taacacaccg ctatcccctt 
tcaccacagg cactatgcta tagagtagat 
aatgttgtga cttttaacca tcatcttccc 

40 

actattctac tctctgcttt taagagtttg 
tacagcagat gctggtgagg ttgaggagaa 
45 tgtaaattag ttcagccact gtgcaaagca 
gaactaccat ttaactcagc aatcccatta 
ttccaccaaa aagatatatg cattcacaag 

50 

gacgtgaaat caatccaagt gcccattaat 
acaccatgga atactacaca ggcataaaaa 
55 ggatgcagct ggagaccatt atcctaggca 
gcgtgttctc acttgtaagt gggagttaaa 
acaatagaca ctaagactat caaaaggggg 

60 

atagtgggta caatgctcac tacctgggtg 
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agatcgggcc 


actgcactcc 


agcctgggcg 


1080 


aaaaaataga 


atacgtaact 


ctggttacat 


1140 


tctggatctg gttgtcttga 


gttaaggaat 


1200 


tgacagattc 


tccaaagtag 


acataataag 


1260 


gatagcgtaa 


tattttaaat 


gctgactcaa 


1320 


attagcgtga 


agtataagca 


gccccatagc 


1380 


aacatactgt 


actctaacta 


cattatttct 


1440 


tttgcataac 


ctgccattag 


agagaaggca 


1500 


gagactgcat 


aaactaatgc 


ccactaggtt 


1560 


ttcagttttc 


tagaaggtcc 


agtttccaaa 


1620 


tagctacaaa 


taatgatact 


gcatagtttt 


1680 


ctgagattat 


ctcattagat 


ctgttaaatc 


1740 


cactctactc 


tcaaatatct 


cccttcatga 


1800 


gttttttaaa 


accacttttt 


aattttaagc 


1860 


tacatatgta atgcatagat 


ctgagtgagt 


1920 


caccaccatc 


aaggccataa 


acacatttat 


1980 


attattattg 


ctttttactt 


tttctttggg 


2040 


agcaatttta 


agcacacaat 


acagttttat 


2100 


ctccagagct 


tagttatttt 


gcataactaa 


2160 


atttatccct 


cccacaagcc 


actgacagct 


2220 


gcgattttag 


gctattatct 


aaaagaaaaa 


2280 


aagggaacac 


ttctacactg 


ttggtgagaa 


2340 


gtttggagat 


tgctcaaaga 


actttaaaca 


2400 


ctgagtatat 


atccaaaagg 


aaataaatca 


2460 


ttccatcaca 


gcactattca 


caatagcaaa 


2520 


ggtggattgc 


ctaaagaaaa 


tgtaatatat 


2580 


agggtgaaat 


catgtccttt 


gcagcaacat 


2640 


aattagcaca 


gaaacagaaa 


accaaatgcc 


2700 


cattgtgcac 


acatggacat 


aaagattgca 


2760 


aggaagaaag 


ggggaaaggg 


ttgaaaaagc 


2820 


atggaatcaa 


tcatacccca 


aacctcagca 


2860 
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tcacccaata tacccatgtt tcaaacctgc 
gaaattataa aaaataaaat tatgtaactt 

5 

taaataaaat ggaaaaaaat ttaagagttt 
gctatttgtc tttctgtgtc tggcttattt 
10 ctcacaaatg gaaggatttc tttcttttct 
cacacacaca atttattcac tcatccattg 
ttacgaataa tgctgcaatg aaaatggaag 

15 

ccaaaagtga gattgtttga tcatatgtca 
atattgtttt caataacagc tatagaattt 
20 ctattcacca catcctcacc agcacttgtt 
acaaatgtaa gatgctatct cactatggtt 
gttgagcaca tttgcatata cttcttggcc 

25 

ttgagaaagt catggaataa acttaagtgc 
tggtgtgtat atacatattg tggaatacta 
30 ttttgcagca acatgaatga aactgcggcc 
aaagtcaaaa accacacatt ctcatttata 
catacagagt agaataatac aaattggaaa 

35 

gaggaatgaa atactaccta ttgggtacag 
aagccgaggt ccagacttca ccacttacaa 
40 cccataaatc tattttttta aattgcccat 
tctgttgtat gtgtcctttg tatattttgg 
gcaaatattt tctctcattc cataggttgc 

45 

ctttttagtt tgatgtagtc ccacttgttt 
tcatatccaa aagacatcac tgccaagata 
50 ' ttctaggagt cttagggttt cagattttaa 
ttctgtgttt gatataaaat aaggctccaa 
tactaacaac atttattgaa gagactaccc 

55 

ctgaaaatta gctggccatc tatgcatggg 
tagtctatat atctgttttt atgtcagtac 
60 tatagtttga aatcaggaag tgtgatgcct 



atgtgtaccc ctgaatctaa aataaaattt 2940 

aaaaaaatat tgctgagatg gataaataca 3 000 

agattccaca tagtgagatc atacagtaca 3060 

cacttaatat cctccagatc cattcatgtt 3120 

aaggccagat tacacacaca cacacacaca 3180 

gtaacattta gcttgttcca tatctgacta 3240 

tgccaaattt caaatccttt ggctatatat 3300 

gttctatttt taacttttct agggacctcc 3360 

acattctcac caacagtgtg taagtgttcc 3420 

atcttttgat ttttaataat agccatccca 3480 

ttgatttaca tttccctgat tattagtgat 3540 

atttctatga cctcttttaa gaaatgccta 3600 

ccatcaataa atgattggat ttttaaaatg 3660 

ctttgctgtg gaaaagaatt aaatcatgtc 3720 

attatcctaa atgaaatgac tcagaaacag 3780 

agtgggagca aaaaagttgt tacacacgga 3840 

ctccaaaagg tgagagtgtg gaagggagtt 3900 

tgcacagtat ttgggtgatg agtacactaa 3 960 

tatgttcatg caacagaact gcacttgtac 4020 

ttttaaattt ggttgcttta ttttttttat 4080 

atattaaatc ccttatcaga tatatggttt 4140 

tttttcattg tttttcttac tgtgtggaag 4200 

atttttgctt tgttgcctat gctttacagg 4260 

aatgttaagg atctttctct ctgtgttttc 4320 

aataaattac tttattcatt tttacttaat 4380 

ttcatccttt tccatgttaa tatccagttt 4440 

tttccccatt atgtagtctt tctgcctttg 4500 

tttatttctg ttctctctgt tctgttccat 4560 

cactctgctt tctttactat agctttgtaa 4620 

ccagctttgt tcttgctcaa gattgctatg 4680 
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gctattcaga gtcttttgtg agtccataca 
aagtatactg cttttgatag aatagacatt 
5 catgggatat ctttccattt atttgtgtct 
ttttcagtgt acaactttgg catttccttg 
ttctctgtta aacatcattt tgtgtgtgtt 

10 

ctatgttctc ttgcagctca ctgatgttct 
gcagttcata gatatccaat tttaaaggtc 
15 tggtgtccta tttccttgat actttgtgtt 
ttgaagaagc agtcaatgtt tctaatcttt 
acagtgagcc cagccagaga ttcttggggc 

20 

ctatttctct tgttccttct tagtgggcaa 
aaagtcaggc ctgatgctga gagcctgcca 
25 tttaaaggtt gtgcatcttt tttttttttt 
cactctgttg cccaggctgg agtgcagtgg 
cccaggttca agagattctc ctgcctcagc 

30 

ccaccatgcc cagctaattt ttttgtattt 
caggatggtc tcggtctcct gacctcaggt 
35 gggattacag gcgtgagcca ccacgccctg 
gccagttggt aagatccaca gctgctgttg 
ctgagctgtg catgggcatg tgtggggtac 

40 

agaatgtgtt ggctatttgg gaagccctct 
gcctcttggc ggagttcaag agctggttat 
45 aatgtgtttg taaaacatct ttggttttca 
tgagcaggta tcatctccac tctacaaaag 
agtgctgtac ccaccacatc ttgcagagga 

50 

tgctgcttct aagcgggggt gtgtctgggg 
gccgacagat caagcacagc ctcttcctcc 
55 atgcccacgc agccaggaaa gagcggcatc 
ggagcctgca ggctccgtgg tgccacggga 
tctctagcac agccccgctc ttgggcgtta 

60 

ccgaggaagc agaccccacc gggctcccga 
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aattttaaga ttgttttttc tatttctgtg 4740 

ttaacaatat tagttcttcc aatctatgaa 4 800 

actttaattt ctttcatcaa cattttatag 4860 

gttaaatgta tgccagtgtt ttcacaattt 4920 

tattgttttc ctggttttgt taaattgtct 4980 

ttaagatgat tattttgaat tctttgtcat 5040 

aaccacaaga gcttcacttt gttcttttgg 5100 

cctcaaagct ttgcattact gtcttcacat 5160 

actggttttc ttcagtagag aaagaccttc 5220 

ctcatgtact ttttctataa tgtgcacact 5280 

gtctcaggat tgtgtgcctt ctttcaatcc 5340 

tttattttcc ctaaggcaat gtcctgaagt 5400 

tttttttttt ttttttttga gatagggtct 5460 

cacaatctcg gctcactgca acctctgcct 5520 

ctcccaagta gctagggcta caggggcatg 5580 

ttagtagaga tagggtttca ccatgttagc 5640 

gatccgccct cctgggcctc ccaaagtgct 5700 

cctcatcttc tttcaaagca gcagtgtcaa 5760 

agatccaagt gctgtctgtg ggagctcatt 5820 

tattcaaggt ggggttccca gggtgctagg 5880 

gacaagtcat tctgcagggt ttatgggcag 5 94 0 

cccagtggaa tatgtttgta attgcagtgg 6000 

aagttctgca gcatgagcat atgaggtacc 6060 

aataaactga ggaccaaagg atttaagtgc 6120 

ttctaagtgg caaggagctt ctgacaatga 6180 

ttagcagtgg ggccaccacg tgatcaatcc 6240 

tgggtgcgcc aggcccaagg acagaggatg 6300 

attgtcagcc cctaccccac cctcgtcttg 6360 

gccccaccgc acctcacgta tccgcgcgag 6420 

gaaccccgcg aggggggatg aaggacgcca 64 80 

ggctgccagg ccccccgctc agggcgctgc 6540 
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ctgcagaacc acgcccccct gtccctgtgc 
ggcctggtta ccattcgcga ctccctcgtc 

5 

ctagtaacca tggtatagtt gagacgatcc 
aggcgtctag gaggcttacc tggcattgcc 
10 aaacaccagc ggggctgttg gccttgtgag 
gttctcagca tctgcccaat ttgagctcct 
ttcctaatgg ttcaggtggc cataagatat 

15 

aagatctctg tttatcacct tgagatgttc 
tcacatagcc tagattttaa gtcgttgcat 
20 agcttgctct ttaggacagt agtaataaaa 
gttacatact agacactttt ctaaattatc 
ctcctatact aagacctcaa atagcatgaa 

25 

actgcaggaa cacatagaaa cttcgttgta 
ccaaaaaaaa aaaaaaatgt accatgcaca 
30 tatgacaaaa acatacaaaa aaatgactta 
tggagtatgg tttgagaaat tagcctggaa 
cttgtatgtt tgcagactat tctaaagtca 

35 

tatgaagaat tgctgtggct gagatgtggg 
atgaatatta gaggacagat gaccgaataa 
40 tgagctcctt gcgaactgag ccagtgatct 
cctcaatata gtaacagatt tggatttgga 
tggttcagct gggttggact ggctttgggg 

45 

gcagtgaggt attcaagttt aacaatactt 
cactgaggaa tagaagttta aattaggaat 
50 gctcatggag aaggttagag gagagttcca 
tcattcctaa gatataagac ttatgtcagt 
tgacatcagt gaaaatggca cagtggggaa 

55 

ttcagaagca acattctcac aactctgaaa 
gcttgatcaa gaaaacacaa ggaaatgttg 
60 cctagtctca tttcccactc cccagctcgt 



taagagggtg acgcgagtgc ccgaagccct 6600 

ctgggttctc ccattgccag gaatgtttta 6660 

actcttgtcc tacgcatttc ggcacaatac 6720 

tgagagaatc ttccatccat ttaagtaagt 6780 

ttgtggcacc gtgaaaaacc aaggcctgca 6840 

gtttgaaacc actgttgggt ttcccttctc 6900 

cctcttaagc ttgagtgtca gttattttat 6960 

tgcagggtcc ctgaagtgaa tttcgtgtat 7020 

tccagactta ccattcaaat ttggagtttc 7080 

ctacaatcat gactacattt attgcacaat 7140 

tattaatact aattttatga atgccatgag 7200 

atgaaatcca gtcagtttag ctgtttgcaa 7260 

cttatacatt tatttgacag tcactgagct 7320 

gtgcttagaa tctacagtta caaagatgaa 7380 

ttggtggaac tgaaactgac cagaaatggc 744 0 

aggaagctgg agggcacatc gtgaaagcct 7500 

gtagggagcc tctaaaatgg tatgatgata 7560 

gaataggctg aaggagcaat gaaatgaata 7620 

gaaatcttca taatagtcca agcaaaaaaa 7680 

ttgtctacca tttctatctc tacacccacc 7740 

gttttccatg ttttaatggg ttggttgatt 7800 

gaagactaaa ggggaggaag ttggaaggga 7860 

cagttggatt tctgggcacc attttaattc 7920 

gggacaaaag ccaagtgcag gatatggaaa 7980 

atgcagtgag catgtgggaa cccaaaagac 804 0 

gaaagtaatc taaaacgaca tggttagagg 8100 

caccaagcaa ttaataagct ggaaaaaact 8160 

attaacgaaa acttgcagca accaagggat 8220 

gaaacagatt tttgtgacat tttaatttat 8280 

cagtgccctt gaataaaaca gccttcattc 8340 
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tctgtatagg tacccagtaa cagagggagc 
ctgcttgtta gatctacctg gtggttccat 
5 ctcgcctaaa acagaactat cccggagctc 
taaaggaaaa aacataatag tcattgattt 
aatagactaa ctgaaaagtt tgggagaaaa 

10 

actcccacat atttctgata atctagaagg 
tcagaaaaga cctgaaaaca tcctaaggtc 
15 aagtaggaag taaaggataa gcagagtggt 
atggtgtgag aaaagggtcc aatttcatta 
accatttatt gaagaggctg tcctttctcc 

20 

tctgttgacc ataaatgtgc aggtttattt 
tgagtctgtt ttcatggcaa taccatgttg 
25 caagattata tagtgtgatg acttcagctt 
gttggggtct tttgtgattc cacacaaatt 
agattgaatt ttgatagaga ttgcattgaa 

30 

ttcaacaata ttaactcatc caatccatga 
ttcttccatt ccttttatca atggtttgta 
35 ggttaaattt gtttctacat attttttata 
attttttgga tagttcattg ttagtacaaa 
tttctttttc tttttttctt tctttttttt 

40 

tgtcacccca ggctggaatg cagtggcatg 
ggttcaagtg attcttatgc ctcagcctcc 
45 catgcccagc taatttttgt atttttagta 
ggtctcgaac tcctgacctc aagcgatcca 
atagacatga gccaccacat ccagccgcat 

50 

tttgcttatt agttctaata gttttttggt 
catgtcatca tgtcta'tggg gttttctata 
55 tgtgtccagg aagtggaaca tggagtcaag 
attctttacc cagttgagtt tttctttgtt 
tatcctatgc ctgtcacact attgtgtttt 

60 

cccacagctg gagggaaact tatctcagga 
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aaagcaaatc 


ttatcctcaa 


agaactgcag 


8400 


agagaaactg ctcagagaac 


ctgcctttac 


8460 


aggtaagcag tggcatttgt 


caaaaacatg 


8520 


ctgaggcaag 


gaataatggt 


tgggacaaat 


8580 


ggccagggaa 


tgagatgctt 


tgagtaacaa 


8640 


ccacagacat 


gccctgggct 


agacacatgc 


8700 


tcaccttcgg ataatgttca 


gactctgcac 


8760 


ctttaatgca 


ttttgagttg 


atttttgtat 


8820 


ttctgcatgt 


ggatacccag 


ttgtctcaac 


8880 


attgtgtgct 


tttggcatct 


ttgccaaaaa 


8940 


gtgagttctt 


cagtctgttc 


cattggtcac 


9000 


ttttgattat 


tatagctttg 


taatatattt 


9060 


tgttcatttt 


gctgaagatt 


gctttggcta 


9120 


tcagattttt 


tctctatttc 


tgtgagaaat 


9180 


tccatagact 


gatttgggta 


gtatggatat 


9240 


acatgaaata gcttttcatt 


tatttgtgtt 


9300 


gttttcagta 


tataaatcat 


ttaactcctt 


9360 


ctactgtaac 


tgggtttgtt 


tccttaattt 


9420 


gaaaaccttc 


tgattcttgc 


atgttgattt 


9480 


tttttttttt 


tttgagatgg 


agtcttgttc 


9540 


atcttagctc 


actgcaacct 


ccacctccca 


9600 


tgagtagctg tgattacaga 


tgtgtgccac 


9660 


gagacaaggt 


ttcaccatgt 


tggccaggct 


9720 


cctgccttgg 


cctcccagag 


tgctgggatt 


9780 


gttggttttt 


aaccttgaaa 


ccttactgaa 


9840 


ggaatctatg ggattttcta 


tatataggat 


9900 


cataggattc 


atttcctatc 


tggatgccaa 


9960 


caagattatt 


ctgtagactt 


gagatttaac 


10020 


gctatttctc 


ccttttagaa 


tgaaaatatc 


10080 


ggaagcacaa 


aacttattcg 


atttcacagg 


10140 


tgaatcatgc 


cttgaatctc 


atgcctatat 


10200 
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ctgatttaga 


tgatactctt 


gactttggac 


ttttgagctg 


ttgctggaac aagttaagac 


10260 


tttgggtctc 


tagggataga 


atgactgcat 


tttgtatatt 


aggacatatg ttttggggga 


10320 


ctatgggata 


aatgctatgg 


tttgaaggtt 


tgtcccctcc 


caaatcgtgt tgaaatttag 


10380 


ttgccgtttt 


atcaatatta 


agaggcaaga 


cctttaagag 


tgattcatcc atgatgtctc 


10440 


tgctctcatg 


aatggattaa 


tgccttatca 


ctggagtggc 


tttgttattg agggaatcgg 


10500 


ttcattataa 


aaggaagagt 


ttggtcctct 


tttgcctctc 


tatctctcac catctctttg 


10560 


cctttccacc 


atgggatgac 


acaacagtaa 


gacccttgcc 


agatgccaac tcctcaatct 


10620 


tagacttccc 


agcttccaaa 


actttgagcc 


aataaattac 


acagtctgtg gtattctatt 


10680 


acagcacaaa 


atggactaat 


acaacttgag 


aggagcctaa 


tatatgatgt atcctggaga 


10740 


atgttccatt 


agtacttgag 


aagaatgtgt 


attctgctac 


tattagaaga aatgttcttt 


10800 


atttgtatat 


taggtctatt 


aggtctacag 


tgtagttcaa 


gtccaggatt tccttaatga 


10860 


ttttctgtct 


ggatgatctg 


tccattgctg 


aaagtgggat 


attgaagttg cctactatta 


10920 


ttgaattgca 


gtttattgct 


cccttcatat 


ctattaatat 


tatatatatg cacacattta 


10980 


tatggtgctc 


caatgtcggg 


tgcatatata 


tttacaattg 


ttatgtcctt ttgatgaatt 


11040 


gaccccttta 


tcactatata 


attactttgt 


ctctttttac 


gattttttac ttaaagtcta 


11100 


tcttgacaca 


tataagatag 


ctacccatgc 


actggtctcc 


ctttgtgtgg aatatctttt 


11160 


tccatccctt 


cacttttagt 


ctaagagtgt 


ctttaaaggt 


gaagtgagtc tcttataggt 


11220 


tccatagagt 


tgggtcttat 


ttgtttattc 


attcaaccac 


tctgtctttt gattggagaa 


11280 


tttaatctat 


ttacatttaa 


ggtaatgttt 


gacaagtgaa 


gagttagtat tgccattttg 


11340 


gtatttgttt 


actgactgtt 


ttgtagttcc 


tttgttcttc 


caatcttgct ggtttccttt 


11400 


gtgatttgtt 


gaatttttgt 


agtgatatgc 


tttgattttt 


tttttatctt tttcacatct 


11460 


gctataggct 


ttctttgttt 


tgttttgggt 


tttttttggt 


ttttctaata ctttaagttc 


11520 


tagggtacat 


gtgcacaacg 


tgcaagtttg 


ttacataggt 


acacatgtgc catgttggtt 


11580 


tgctgcaccg 


atcaactcat 


catttacatt 


aggtatttct 


cctaatgcta ttcctccccc 


11640 


agccccctga 


caggccctgg 


tgtgtgatgt 


tccccaccct 


gtgtccacgt gttctagttg 


11700 


ttcaactctc 


acctatgagt 


gagaacatga 


ggtgtttagt 


tttctgtcct tgtgatagtt 


11760 


tgctgagaat 


gatggtttcc 


agcttcatcc 


atgtcctggc 


aaaggacatg aacttatcct 


11820 


tttttatggc 


tgcatagtat 


tccacggtgt 


atatgtgcca 


cattttctta atccagtcta 


11880 


tcattgatgg 


acatttgggt 


tggttccaag 


tctctgccat 


tgtgaatagt gccacaataa 


11940 


acatacgtgt 


gcatgtgtct 


ttacagtagc 


atgatttata 


atcctttggg tgtataccag 


12000 
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tagtgggatt gctgggtcaa atggtatttc 
tgtcttccat aatggttgaa ctagtttaca 
5 ttgtccacat cttctccagc acctgttcgt 
ctggtgtgag atggtacctc attgtggttt 
atgagcattt tttcatgtgt ctgttggctg 

10 

tcatatcctt tgcccacttt ttgatggggt 
tttgtagaat ctggatatta agccctttgt 
15 attctgtagg ttgctgctat aggctttctt 
tcttgtagat atacctattt taagctgatg 
ctacactttt acaccccata ccacatttta 

20 

aaagacgaaa ataatgggag tgggtcagct 
taaggtcatt acagaagaat aaaatctaag 
25 ttgctgttta gcagagaata gtgtcagtgg 
gctaagcagt ggaaaatcag aggcttccat 
gtattacttg ttctctatat ttctctatag 

30 

tatacaatat ctctgttaaa atgctgagaa 
tgggggtaag gactaagtgc tacttgggtt 
35 agaaaaagga aaagacattg accacatctt 
cctaaggtga tgataactag cctaaagcta 
acattccgga atagttcaac gttaccacaa 

40 

tgtttgctga tatatgagag gtccattttc 
gctcatcaat cataaagtgg tcaggtaatg 
45 ggggcactgt actgtcaaca tgtgcccctt 
tcaatgatct cccgctcagc tctcctcaat 
aatatttgtg ggaccttgaa tcacatttgc 

50 

aggaaaattc tcaacgtatc aaatatatta 
acactatcag aatgcagggc atacctacat 
55 taacgatgtt agctagggtg atcaatccag 
acaatgaata gaggcaaaaa tgaatatgtc 
ttagtactgt catgccccat gattaaggtc 

60 

aagcctatga atggcccaga ctcttcagga 
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tagttctaga 


tccctgagga 


atcgccacac 


12060 


gtcccaccaa 


cagtgtaaaa gtggtcctat 


12120 


ttcctgactt 


ttttatgaac 


accattctaa 


12180 


tgattcgcat 


ttctctgatg 


aacagtgatg 


12240 


cataaatgtc 


ttcttttgag 


aagtgtctgt 


12300 


tttttttttc 


ttgtaaattg 


gtttaagttc 


12360 


cagatgggta 


gattgcaaag 


attttctccc 


12420 


ttgtatttac 


catgaggctt 


aaataaaaaa 


12480 


acaacccaac 


ctcacttgca 


tacaaaacct 


12540 


catttttctg 


ttacagttta 


cttctctagg 


12600 


acatcaaaag 


gtatctactc 


agtctcattt 


12660 


aggtagcttt 


tggcttttgc 


caggaagtta 


12720 


gttgatgcaa 


aaagcttttt 


gcaaggaatt 


12780 


ttcaagtgcc 


ttatctcaaa 


ataatcatta 


12840 


caaaggagtc 


cattcatcct 


ctataactgc 


12900 


gatttatcct 


aaaagaaggc 


accaaagcaa 


12960 


aataaatgac 


tagtgttgct 


tcttcctaat 


13020 


ccccaaagga 


aggagaattg 


acaaaagtgt 


13080 


attggactca 


aatgtgacgg 


acccaaaaaa 


13140 


tccacccagg 


gttctaacaa 


atacattctg 


13200 


tgcctgtgtc 


tgtaaacagt 


ttgaatcaaa 


13260 


ggcttaaaac 


cagaaaaggc 


tgtagaccct 


13320 


acattcagct 


caggtgggtc 


acattctgct 


13380 


cttagaatac 


tatacccttt 


tgtattatcc 


13440 


atttaagtca 


gcaacattaa 


accttgcact 


13500 


ttatctccag 


ctggttgaga cctacccagt 


13560 


gcctctgagt 


caacaggcaa 


aaatgagagt 


13620 


aatqccaaqq 


ggaaattgga 


ccactacttc 


13680 


tggaatacag 


gagatccttt 


aagatgtctc 


13740 


aatgggaaat 


tacaacaacc 


caatccaggc 


13800 


ttaaaggttt 


gggttatcct gtcaggtaaa 


13860 
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gaaacatgac cagccaaggt tgcataatac 
gcactgtagc ccctttctga ggcattcctg 

5 

ggacagaact tcaagcagta cacgaggttg 
gtgtgtgact atatactaat tcatgggctg 
10 cttgaaacga atatgattag aaaagtggtg 
tagacttctg aataggcaaa acacataaag 
agagcatgat tctcctgcct cagcctccca 

15 

tgcccagcta atttttgtat ttttagtaga 
tcttgaactc ctgacttcag gaaatccacc 
20 aggcatgagc caccatgccc agccagcaga 
ggcccattct gtggatacca ctcagcctct 
gcccattaac aaagtggcca tggtggcagt 

25 

ccacatgggc ttccactcac caaggctgac 
tgccagcagc agagatgaac attaggttcc 
30 cctggtggca gaatggctac attggaccat 
ttactggaat agatacttat tctagatatg 
caaaaactac catccatgga cttacagaat 

35 

agcattgttt ttgatcagcg aactcacttc 
gctcatgtaa tttgctggtc ttactatgtt 
40 ggatgatgga atgaactttt gaagacaatt 
taactggggg aaggttcact agcagagtgt 
aaatcaggga gtgaaaatgg gagtgacacc 

45 

aaaaatttgc ttcctgtcct gaaacattat 
ggaagaaatg cttctaccag gagacataat 
50 cccaactagc cattttggac tttttatctc 
tgctggaagg ggttatcgat actgactacc 
taaggaagtg tatattagta atacaggaag 

55 

cctgtgatta aggtcaatgg aaaactacaa 
ccagactctt caggagtgat ggtttgggtc 
60 gaggtgcttg ctgaaagcaa aggaaatatg 



ccaggcacaa 


ccccaaagta 


gacagctgta 


13920 


aagaaaataa 


tgaagaaaaa 


tcttcccagt 


13980 


tatattttgc 


ttggaaggag 


aaatagccgg 


14040 


tggccaatgg 


tgtggctgga 


atgtcaggga 


14100 


acataggagt 


ctagagaaca 


ggtttgtgga 


14160 


atatttgtgt 


cccatgttaa 


tgtccaccaa 


14220 


agtagctggg 


attacaggtg 


cacaccacta 


14280 


gatggggttt 


caccatgttg 


gccaggctag 


14340 


cgcctcggcc 


tcccaaagtg 


ctgggattac 


14400 


ggaagatctt 


aataatcaag 


tagataggat 


14460 


ttcctcagcc 


accccatcat 


tgtccaatgg 


14520 


ggcagggatg 


gaggttatgc 


atgagctcag 


14580 


ttggctacag 


tcactgatag 


gtgcccaaaa 


14640 


tgatatggta 


ctattccctg 


gggtgattag 


14700 


ttctgtcatg 


gagagggcag 


tgttttcttc 


14760 


gttttacctt 


ccctgcattc 


aatgcctctg 


14820 


gccatatcta 


ttgccatggt 


aatgccacac 


14880 


agagcaaatg 


aagtgcaaca 


ataagctcaa 


14940 


ccccaccatt 


atgaagcagc 


tggtttgaca 


15000 


acagcaccag 


ctagctggca 


atatcctgca 


15060 


ttctcccaca 


gctaggattc 


acaggtccag 


15120 


actcactatt 


acccatagtt 


acctactagc 


15180 


actgtgcttg 


tcagatgtct 


ttattccaat 


15240 


aatgatttca 


ttgaactgga 


agttaggact 


15300 


tgaatcagtt 


ggcaaagaag 


ggagttattg 


15360 


ataaggaaat 


ttggcctatt 


acttcacagt 


15420 


ccccctaggg 


tatcctcagt 


attaccatgt 


15480 


caacccaatt 


agaacaggac 


tactgatggc 


15540 


aacccaccaa 


gtcaataatc 


acagcctgct 


15600 


gaatgggtag 


tgggagaaaa 


taatcataaa 


15660 
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taccagctat aaccatgtgg ccagttacag 
ttccttattt tgttatgaac atgtttgtgt 
5 tccctctctt atccccttat tatgtaacat 
aagttgtggt atttcaacag aagagtaaac 
aaaagtgtta gtgcattttt tgttgtatgc 

10 

ctttggagat taagtatggt ttcacgagat 
gtggacatga agttgacttg tgatgggtaa 
15 atgcctagat agttgttcaa acattattct 
cattaacatt taagtcgctg gtctttgagt 
attttattca atcagttgaa ggtctgagtg 

20 

tccctcatcc ctcccctcat ggagcaagag 
atctgcacca tcagctcttc cgtatctcca 
25 ttgccagcca aaatttacag agtacatgta 
caagtctaaa tcattttaag atgtttgttt 
cccagaacca cagggaaaat tgtgacccat 

30 

gtattcaaag gcaaaagcgc aggtatgagg 
ctagataaga ggcaagcagt tgcatcctct 
35 acaatgtacg catgacaggg gggtacagga 
aaatctgcct tttcacataa gataaaataa 
atttgtctca ggtgagcaga gagatgactg 

40 

ataaggtcct aagatttatt ttccttttac 
tatataaaat atatagggtc tgtttctctg 
45 tctcaatacc agaacagaaa cccatatatt 
attgaagtag ctttaattca cttaaaacta 
ccaggggata aagtagttgc agggcagatg 

50 

ttcggtggtt gcacacatcc tagtcactgg 
aataaagccc agctccaagc aggccccaga 
55 caggagtcac tagagggcct gcttgcctgc 
agattggtta caaaggtcag cactggttgc 
tccaagtcac aaacaagaga ggatgaggga 

60 

gattcacagt ggagaacaag agctgagtct 
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aaataaggac 


tgtaattgtc 


atgagtattt 


15720 


gtttgtgtgt 


attacatacc 


tttgttttct 


15780 


aagatgtgtt 


gactttatgt 


tatggctttt 


15840 


gtcactcaag 


aattttgcat 


cctcttctgg 


15900 


atgacagctg 


ttaggcaaaa 


gtacatcttt 


15960 


gtgtaagtgg 


actgctaagt 


ggactgctaa 


16020 


ttttatgtgt 


caacttatct 


ctgtcaaaat 


16080 


ggctgttcct 


gtgagggtgt 


ttgtggatga 


16140 


aaagcagatt 


gcccttcata 


atgtgggtag 


16200 


aaacaaaagg 


ctgaccttcc 


cccctccatt 


16260 


gaattttcca 


gaagactgcc 


ttcaagcttc 


16320 


gcctgctggc 


ccacactata 


aattttggac 


16380 


cagatacaga 


gtaaatcaaa 


acgactgagg 


16440 


gccaaggtta 


aggacatgcc 


tgagaaagaa 


16500 


actttttctg 


aagagggtct 


ggagacctca 


16560 


aaagaggaag 


aaatttttta 


aagggtgtga 


16620 


tgagtgtttc 


atcagccatt 


ccctgaatac 


16680 


atagttactt 


atacatttgt 


ctagtccaat 


16740 


acatagggca 


gaggaagcaa 


tcagatatgc 


16800 


agttctgtcc 


ttcgtcccac 


acctgtgaag 


16860 


taacatatat 


attatacaca 


catatatgta 


16920 


gagaactctg 


aataatatac 


ttttacataa 


16980 


ccctatgaac 


atgaatttta 


aggtaatctg 


17040 


cctaaataga 


tcagaaagaa 


agaatgggca 


17100 


atgttgggtt 


ttaatctcca 


atacggcccc 


17160 


tgtgcgcagg 


tggctctact 


ccggctggaa 


17220 


gaaccacttc 


acataagagc 


tatagaaaac 


17280 


ttttttaaac 


aaaatctaaa 


actatoatcc 


17340 


aggagcaggc 


agagcagaga 


gaaactgagg 


17400 


agtaaggcaa 


gatcaaaagt 


tcctatttgg 


17460 


acaggctgga 


gctggcctca 


gagaaccaac 


17520 
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catgggacgg agacaaatat caaggagaac 
ttttgttgtg tgtcctgggc caaacaaata 

5 

ctagaaaaat aaatattagt tgtctctaaa 
tgtttaaata catgaggaga ccgcaccacc 
10 tgctttctct ctgacacaca attctttcac 
acattctcct aacctctcag tcacccagac 
taataatgtt ggctctctct tctcacaggc 

15 

tgcactggga actctctgaa ttcattggat 
tcctatccct ataattgctg tagtaattta 
20 accatggtat tattgctttc aattcattta 
atgggctaaa gacccaatct ctaacttaaa 
gaaatgcctt atgtttgttg tatattttaa 

25 

tggaaaatat attaaaggag gatgcttgaa 
ctacaaggct tatcttttgc agaatgaaaa 
30 tacagaggca atttttagag agcatgtata 
cacaggagga atttggggaa attaaacaga 
tattataata attccatggt agcagttgcc 

35 

agaagcagaa agtttaccca aggttaaatt 
tacccacaac tttgtacttt tcaaccaaaa 
40 taacacattc ccttttgctt tgtgaaaagg 
cccagagaag ggcctggtga tctctgctct 
tcttatagtg ggagtcctca agtggaagac 

45 

agtgtatgta ataaagacaa atcatttaaa 
aatcatctag aaagaatcct cagcccaaga 
50 gccatagata ataattactc aattttatac 
ctccaagttc attttacctt aattgccacg 
ccatttgaca aacagggaaa ctgaggcttg 

55 

cagcaatgaa gtgtcagagg taaaaacaaa 
tgacaggatt tcattttcct gggctgaagg 
60 tcatcttttg ttattgccgc aggcagattg 



cttaaggatc 


ctcagagcca 


acagctctcc 


17580 


ggcctctgct 


ttttacttaa 


ctagaaccaa 


17640 


ggttgctatg 


aaacaaaagc 


aaatactccc 


17700 


taactgtggc 


tgccatttct 


ctattcagcc 


17760 


tactgcttca 


gaagtaccag 


ggtaaagcaa 


17820 


tgtaatcatc 


tgtggtagat 


ttgacttgag 


17880 


tgtgcaaata 


atgattggcc 


tgattcacag 


17940 


tttgctaaga 


gaggattcaa 


gtggttctgg 


18000 


ctcatttttg 


tcattatttt 


ttgtgagtaa 


18060 


tttttttcta 


ttgaattcag 


gtgtctagaa 


18120 


ggagcccatt 


tttgtgtgaa 


tacagactta 


18180 


taattttctt 


gagaggttga 


ggtgcctata 


18240 


gaataataaa 


gagagtggct 


atttgggtaa 


18300 


atccaaaggc 


cttagcttaa 


tgtctggggc 


18360 


cagaaaaaaa 


aaaaaaagga 


gacctagagg 


18420 


gatttctaaa 


gtagatgttc 


tgaataaggg 


18480 


aaaactccca 


ccaagggcat 


gcccaattac 


18540 


tctccataga 


agagagtttt 


attacctcct 


18600 


acagggcttc 


aagggagaga 


agatttagta 


18660 


cctctgagac 


atccctgtac 


agtcatttca 


18720 


ggataaaaga 


cattattttc 


tctaaggctt 


18780 


atttacaaac 


accgagaact 


ttctggaaag 


18840 


ctgaaataat 


acaattaatc 


acttattgca 


18900 


tatgaaagag 


gagcatatga 


cagatattta 


18960 


ctattatgta 


tcaggcatct 


ccaaaaggca 


19020 


acaaccccat 


gagatagcta 


tcattcatct 


19080 


gagaacttaa 


gtgactctca 


cagagtgaca 


19140 


caaacacaag 


ctaagatgat 


tctgccttca 


19200 


atccccagtc 


cccttcttag 


tatgtttgaa 


19260 


aagacaggta 


tatgatttac 


tggaaaaaaa 


19320 
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ttaattcttg gtattccaga aacaacaatc 
cacttagggt atgatatttg tggaaggcaa 
5 aaactattcc cactgatgaa atttttttta 
agtgagagcc accacactag ccactaggaa 
aataaatgac aattaaaaca agtgaagtgc 

10 

gctccctaag tcccatcacc tatgcaaagg 
ttttcatccc tataacagag aaatagtgaa 
15 ctacagactt cgaatggggt ttttcatgtt 
gccactttgg atattaggaa tgccactggc 
cataaggcat tgtgtgttta taaatgctta 

20 

cagtagtgta atacagaagc ctgttacgag 
ctctaaagat taaaatgtta cttggtgcaa 
25 aacatttttg gggaaaaaaa acttttccaa 
ttgtaaagat atcctccagt gcatcctctg 
agcaataata ttgtatacag agacagacgt 

30 

ctatctcaac gagcatatat aaccaaataa 
aggctggtgt gcagtgggtg atcacagctc 
35 atcctccagc ctctgcctcc caaatagctg 
tttttcattt tttgcagaga taagggtctt 
tggcctcaag gcatactcct gccttggtct 

40 

actaaaccta gccaccaaac aaaacttcga 
aaatgtaatt aggataagat tagaaaatta 
45 aataatgtgt gaaaagagaa ttgccttcag 
cacaaaccag atatcatgac aacacatcta 
tccctcctgt ctctcacagc cccccttgtg 

50 

ctgaattttc caattctgac tgccttacta 
actctgtgcc tcactctcct catttaaaaa 
55 caaagggttg ctgtgaggat taaattagta 
cacagggtgg ctcatgccta taatctcagc 
ctgaagtcgc gtgttcgaga ccagcctggt 

60 

atacaaaaat tagctgggtg tgatggcgca 



PCT/US01/48437 

ctttcaatta gtgccagaag gtgatgatag 19380 

cctataaatc ttcagatgta atgttttctg 19440 

taacattatg ggagaaacgt agttcttgca 19500 

atattttcta ataattcatt tcaagggatc 19560 

ctacaacttg tcctcataaa atcattgtga 19620 

tctcattaat gaccggagac tttaaaacat 19680 

tccatgcttt agaaattgta ctagcggtca 19740 

aaaatcactt tcttcagaaa tgcattcagt 19800 

tctagggaaa gaagctattg aggggattat 19860 

tttctatccc cagttcatca actcaggatc 19920 

atataaggta tgtcagaata ttcggagcaa 19980 

acaaacaaaa aaagaaatta ttgctgggcc 20040 

atgtttaaat gaaaaaaaat tatttaattt 20100 

tgcatgaatg ttggaggagg atacagtgtg 20160 

gagagtttcc atgaaataaa tcatttcaag 20220 

aacttaagac aaggccttac tctgtcaccc 20280 

aatgcaacct ggagctcctg ggctcaagtg 20340 

ggactgcagg tgtgtgccac cacacctggt 20400 

gctatgtttc ccaggctggt tttggactcc 20460 

ccgaaagtgc agggattata gacatgagcc 20520 

ccctgagaat caaagcattg ggaagcgaag 20580 

gtgtctagtt aaggagaatg gataagacaa 20640 

caaaaccagg tcaggaaata tcaattagaa 20700 

acgaaggatg gtcctttttc ttttagaact 20760 

attaggatca cagctctgga ggcagatagc 20820 

gtgcctgtgt aaccctgagc aaattacttt 20880 

ataaaatgtg gttaataata gtaactgcct 20940 

tgtaagcata aagaacagta cctggggcca 21000 

cctttgggaa gccaaaggca ggtggatcag 21060 

caacatggca aagccccgtc tctactaaaa 21120 

cgtctgtaat cccagctact caggaggctg 21180 
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aggcacgaga 


ategcttgaa 


cctgggaggc 


aaaggttgcc 


gtgagctgag 


atcctgccac 


21240 


5 


tgctctccag 


cctgagtgac 


agagcaagtc 


tccatctcaa 


gaaaaaagaa 


cagtaactgg 


21300 




cacaaagata 


ccactctgct 


tatcaaaacc 


.cgctgagaaa 


aaacaggtgg 


attctcctct 


21360 




ttccccaggg 


gaaaatctgc 


tttcagggac 


tggtaagaga 


agaaattgaa 


attgtatttc 


21420 


10 


atatttcaga 


taaaggggtc 


ctcctatttt 


ttctttcagt 


agaatagggt 


cagagtggta 


21480 




tgagaggeca 


tttttctcag 


ccccagattg 


gaagctataa 


ttatgggcat 


taagctatta 


21540 


15 


cacttttttt 


aacttctatt 


cccaaggtag 


ccactgttaa 


tatcattttt 


cattctaggc 


21600 


ctttttctaa 


gctttttttt 


tttttttttt 


tttgtaattt 


gtttgeaage 


agagtgaaca 


21660 




tgtgacttag 


aagtcacctg 


gtgttgcttt 


tecaeggaaa 


accttacatc 


cacagagata 


21720 


20 


taagtacaga 


tgacatgaaa 


geccataaag 


cccccatgtg 


atttggtaaa 


ctcccagaac 


21780 




tagcattctc 


atagtttagt 


ttctgtattg 


gtgcatactg 


tcccaaattt 


tctccacaac 


21840 


25 


acccaaagaa 


acagttgtga 


ataaactcag 


ccccttgact 


gattttcaca 


aaaactcaca 


21900 


aaaagttttg 


ggttttttta 


ctcgaggatt 


ggtgggggaa 


gattatgatt 


gtaatttcca 


21960 




agttccagtc 


acaacctaga 


tccctttgta 


actctctccc 


taatccccat 


cactccgtct 


22020 


30 


tctgcccttc 


tttttccatc 


tccccactcc 


acctcaatca 


atcaatctat 


acatatacat 


22080 




atatgtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


gtagatatat 


atatatataa 


tttgetgeat 


22140 


35 


gtcaggcatg 


ttctatgtat 


gtaggatgea 


tcagtgaaca 


aacagatgaa 


aatttattac 


22200 


ccttagggag 


cttacattct 


aatagcagga 


taaaagcaat 


aacaataaac 


aggacaaaca 


22260 




gggaaataat 


ataatatgtt 


agaaggtgat 


aggtgtgatg 


aaaaaaagaa 


aacgtaataa 


22320 


40 


gggctgecat 


agtttcgtct 


tgaacatttg 


gaeggattea 


gttgagaatc 


ttagacatga 


22380 




agatatttaa 


agecaggaga 


cttgttgaga 


tctccagtgg 


aaagaacaca 


acaaagacga 


22440 


45 


aaagagaaac 


aagaactgag 


ccctagggca 


ctccaacatg 


aaaaatagaa 


aaaaggggag 


22500 


aaaataataa 


tttttttact 


tgaaaagttt 


cttggtttaa 


tcatgttcta 


aagagtaaaa 


22560 




tattatatct 


tattcgtgtg 


catgttttca 


agttaaagca 


gaaaataaat 


gtagaatata 


22620 


50 


tgtttttgtt 


tgaatttttt 


tttttgtttt 


ttgcttgttt 


gtttgttttt 


tgagacagaa 


22680 




tettgetctg 


ttacccaggc 


tggagtgcag 


tggtgcagtc 


tcagctcact 


gcaacctctg 


22740 


55 


cctcccaggt 


acaagcaatc 


ctcccaactc 


agcctcccaa 


gtagctggga 


ctacaggctc 


22800 


ccaccaccac 


gccctgctaa 


tttttgtatt 


tttagtggag 


acggtgtttc 


atcatattgg 


22860 




ccaggctggt 


ctcaaactcc 


tgacctcaag 


tgatctaccc 


acttcagcct 


cccaaaatgc 


22920 


60 


tggaattaca 


gtcattagee 


accgtgcccc 


accagaatat 


atgtcttttg 


taccataatt 


22980 
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tttaaattgt tttttcaaac aatatacatt aagtgacttc cttttggggt gcagatattt 23040 

gagttcagga acttttttct gaaaacatat atatttattt tcatgtaaac ctgggaaaat 23100 

5 ctttagctgt ctatgtaagt attatcttaa ttgattcttt tcttaagcat acaactcaat 23160 

agtttttagt atattcactg atatgtgcaa ttattaacac agtccattca aaaacatttt 23220 

catcacctca aaaagcaact ctgtacccct tggctatcac tcccatcccc aacttcccat 23280 

10 

tctccccaac cttaggcagc cactaatcta catagggtct ccatatattt tcctactcca 23340 

gacatttcat taaaaatgaa tcacatcata tgtaatttct tgttaactgg cttatttcac 23400 

15 tcaatatatt gtggtaaaag ttcactcatg ttgtagcata tatcagtact ttattctttt 23460 

tatgactaaa catttctata caccataaca ttcaagctga gagtcaaatc aagaatgcaa 23520 

ttgcatttac aatagccaca aaagtaataa aatacctaga aatacatcta accaaggagg 23580 

20 

tgaaagatct ctacaaggaa aactagttaa tacaaaatat tgcttaaaga aatcatatat 23640 

gacacaaaca aatggaaaaa cattttatac tcatggatta gaagaatcaa aatcattaaa 2 3700 

25 atggccatac tgcgcaaagc aagctacaga ttcaatgcta tttctatcaa accaccaatt 23760 

atcattttta gcacaattag aaaaagtatt tctaaaattc atatggaatc aaaaaagaac 23820 

ccaaatagca aaagcaatct taatcaaaag gaataaagcc agaagcatcc acattacctg 23 880 

30 

acttcaaagt atactacaag ggtatagtaa ctaaccagta ccatgctggt actggtgcaa 2394 0 

aaacaggcac atagatcagt ggaaccaaat agagaatttg gaaatacatc tgcacaccca 24000 

35 aaaccaactt atattcaaca aagtcaacaa aaataagcaa ttgggtaagg attccccatt 24060 

cagtaatggt gctgggataa gtggccatcc atatacagaa taataaaacc gaaccctcaa 2412 0 

aagcaattgc aacaaaaaca aaaattgaca aatgggacct aattaaacta aagactttct 24180 

40 

gcacagcaaa aaaaagaaac tattaaacta aacagacaac ctgcagaatg ggagaaaata 24240 

tttgcaaact atacgtccta caaaggacaa atactcagaa tctataagta acttaaacaa 24300 

45 ttcaacaagc aaaaaacaaa taaccaaatt aaaaagtggg caaatgacat gaacagtcac 24360 

ttcttaaaaa gaagacatac aagcagccaa caaacatgaa aaatgcttaa tatcactaat 24420 

tgtcaggaaa atgtaaacca aaaccaaatg agataacatc tcataccagt cagaatggct 24480 

50 

attactaaaa agtcaaaaaa tagatattgg tgagaatgcg gacaaaagga acacttatac 24540 

actgctggtg gaaaagtaaa ttagctcagt ccctatggaa agcatctgga aatttctaaa 24600 

55 gaactaaaaa taaaactacc acttgaccca gcaatcccag tgcgcgtgtg cgccccnggg 24660 

acacngngnc ncncncncnc ncncncncng ngngngnggg tgcgtcgtcc cactctntnn 24720 

caaagacacg cgctcactcc cggtgcccct ctctggcgat atgggtatag aaaaagagat 24780 

60 

atatctctat atatacatat atatatatat atatatatat atatatatat atatatatat 24840 
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atatgcatac catggaatac tacacagcca 
caatatggat gcagctggag gccattgtcc 

5 

aatgccagtg gggttataat aagtgggagc 
aagaaaatag acattgatga atgctactgg 
10 actacctctt gggtactaac tgggtgatgg 
aaaatacacc caaataacaa aactgtacat 
ttttttttaa agaacgtgaa tagatattgt 

15 

ctgcacaccc acaaccaact ggtattcaac 
gagtccctat tcaataatgg tgctgggatc 
20 tggaccctct gaatgccttt tgggttttct 
taatacaagg ttgaatagat ataatgagag 
cattgagtct ttcaccaata ggtgtgttag 

25 

ggttgagaaa gatcccatct attttgagta 
gaattttcac caaatgcttt gtctacatca 
30 tttactgata tggcatatga cattaattgt 
ataaatcaca cttgcttaca gtgtacaagc 
gcacctcaca gataatgagg ttttttacaa 

35 

caaagctgtt tcttcaacat cacgtgctca 
tctcacaata tttcaaactt ttttatgatt 
40 tgacctttat gattattata tttgtcatag 
cattgttatt gttttggcat accataaact 
taaatatgtg cgttctgact gctctaccaa 

45 

ttgggcttct ctattttcta atatacaaca 
gtcatagtga tctgtgatca gtgaccttta 
50 atcggtgacc tttgatatta acattgttat 
taagataaca aatttaattg ataaatatgt 
cccctgtctc tctccatctc cttgggcttc 

55 

ttaggccaat taataaccat acaatggcct 
atctctctct taagtcaaaa gttataaatg 
60 gtgagatagg ccaaagttaa tcctcttgca 



taaaaaagag 


tgaaatcatg 


tcctttgcag 


24900 


taggtgaatt 


aatgcagaaa 


cagaaaacca 


24960 


aaaacgttgg 


gtacatatgc 


acataaagat 


25020 


gggaaggcag 


gagggaagaa 


agttttaaaa 


25080 


gatcaatcat 


accccaaacc 


tcagcattat 


25140 


atgccccctg 


aatctaaaat 


gaaagttgaa 


25200 


tttatttcta 


tagtttattc 


agaaataaat 


25260 


aaagtcaaca 


aaaataagca 


atggggtaag 


25320 


attggccatc 


catacacaga 


ataataaaac 


25380 


ttactgccga 


actgccctgg 


ttagaatttc 


25440 


tatatctcac 


cgtcttattc 


ctcatgaaag 


25500 


gtgtggattt 


ttattagatg 


ctccttatca 


25560 


ttttgagtgt 


ttttatcata 


aaacagtgaa 


25620 


attgagatga 


tcatatgtgt 


gtttttctat 


25680 


ataagataaa 


accagccttg 


catttgtgga 


25740 


atacctcgtt 


ttattgtgct 


ttgctatatt 


25800 


agtgaaggtt 


tataataaca 


ctgcatcaag 


25860 


cttcatgtct 


ccgtgtcacc 


ttttagtaat 


25920 


attatatttg 


tcatagtgat 


ctgtgatcag 


25980 


tgatctgtga 


ttggtgacct 


tcgatattaa 


26040 


gtgtctatat 


aagacaacaa 


atttaattga 


26100 


ccacacttac 


ccttgtctct 


ctccatctcc 


26160 


acattgaaat 


taggccaatt 


aattatattt 


26220 


tgattattat 


atttgtcata 


gtgatctgtg 


26280 


tgttttggca 


taccataaac 


tgtgtctata 


26340 


gtgttctgac 


tgctctacca 


accatcctta 


26400 


tctattttct 


gagatacaac 


aacattgaaa 


26460 


ctaaatttta 


agtgaaaaga 


agagccacac 


26520 


attaagctta 


tgaggaaggc 


atatagaaag 


26580 


cccaaccatt 


aacataggtg 


tgaatgtaaa 


26640 
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ggaaaaattc ttgaaggaaa ttaaaagtgc 
acaaaacaag cttgttgctg atgtggagaa 
5 ctacagcatt tccttaagcc aaagcctaat 
attctgtgaa gaccgagaga gatgaagaag 
ttgattcatg aggtttaagg aaagaagttg 

10 

agcaagtgat aatgtaggag ctgcagcaag 
tgaaggtggc tgtattaaac aacagatttt 
15 gaagacgtct tataggactt tcccagctag 
acaggctgac tctcttgtta gaggctaatg 
tcatttacta tactgaaaat tctagggccc 

20 

gtaaatggca taagaaagca tggataacag 
attttaaatt caatattaag aactactact 
25 ctgctcattg acagtgcacc tggtcacatg 
tttgcagtga tgaatcaagg agtaatttgt 
ttttgtaaga ctacagctgt catagctagt 

30 

ttgaagagct tctggaagat tcaccattct 
ggaggaggtc aagatagcaa cattagcagg 
35 gaataacttt gaggggttca agacctcggc 
agcaagaagg gaagcctgaa gatgtgactg 
tggataagga gttgcttctt atggatgagt 

40 

ctccagtgaa gatgctatga acattgctga 
aagcttagtt gataaagcac tgtcaaggtt 
45 ttgtatggtg gataaaatgc tatcaaaaaa 
aaagaagagt tgatgggtgc agcaaacttc 
ggaccccaac cttcagtaaa caccacccta 

50 

gaccctccac cagcaaaaac attacagctc 
tagcaataaa atattcttaa attaagtaca 
55 acacttaata gacgacaata aactgtaaac 
agtctctctc tgtcccccag gctggagtgc 
cgcctcctgg gttcatgcca ttctcctgcc 

60 

acctgccacc acgcccggct aatttttttg 
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tgcttgagtt aacacacaag tgataagaaa 26700 

agttttagtt gtctggacag atcaaaccaa 26760 

ccctagcaag gccctaattg gctcttttca 26820 

cttcagaaga agagtttgaa gctagcaagg 26880 

tctccataac ataaaaatac aaggtgaaac 26940 

ttatccagaa gatctagcta aaataattga 27000 

caatgtagat aaaacagcct tatattggaa 27060 

agacaatgct tgggctcaga gcttcaaagg 27120 

cagctgatga ctgtaagttg aagctaatgc 27180 

ttaagaatta atctactctg cttatggtct 27240 

cacatctgtt tacagcatga tttacagaat 27300 

caggaaaaaa agcttccttt taaaatatta 27360 

tgttttcatg tctgttaaca taacatccaa 27420 

actttcaagt cttgttattt aagaaataca 27480 

gattcctctg atagatctcg gcaaagtaaa 27540 

agatgacatt aagagcatta atgacataca 27600 

actttggaaa aagttgatac tacctctcat 27660 

ggaggaaatc actgcagatg ggatggaaat 27720 

aattgctgta atctcctgat aaaactttaa 27780 

aaagaaagtg ctttcttcag gtggattcta 27840 

aatgacaatg aagaattcag aatattacat 27900 

ggagaggatt gactccaatt ttggaagaag 27960 

cattgcatac aacagaggaa acttttggga 28020 

atgttatcta atcttaagaa attgccacag 28080 

atcagtcagc agccaacaac atcaaggcaa 28140 

actgaggctt aggtgatcat tagctttttt 28200 

tttttagaca taaagctatt gcacacttgc 28260 

agtagcataa cttttttttt tttaaggcgg 28320 

agtggcacaa tctcggctca ctgcaagctc 28380 

tcagcctccc aagtagctgg gattacaggc 28440 

tatttttagt agagatgggg tttcgccatg 28500 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ttagccagga 


tggtcttgat 


ctcctgacct 


tgtgatccac 


ctgcctcggc 


ctcccaaagt 


28560 


gctgggatta 


caggtgtgag 


ccactgcgcc 


cagccagcat 


aacttttata 


tgcattggga 


28620 


aactaaaaaa 


ttcatctgtt 


acgttattaa 


agtattcatt 


ttattgcagt 


ggcct'gaatc 


28680 


tgaacctgca 


atatctccaa 


agtatgcctg 


catatttctt 


tttctgtgtt 


gctgtattga 


28740 


gtttgctaac 


attttgttga 


agatttttgt 


gtccatgttc 


ataagagatc 


atggttttta 


28800 


gttttctttt 


gttgtgatgc 


attagtctga 


ttttgatctc 


agggtaatag 


tggcctcaca 


28860 


gaacgagtta 


ggaagtgttc 


cctcctcttc 


tattcttaga 


agagtttgtg 


aagaaggtaa 


28920 


ttctttaaat 


atttggtaga 


agtcaccact 


gaagtaatct 


gggtctgcac 


ttttcttcat 


28980 


agaagatctg 


attataaatt 


taaatacctt 


aattctttat 


agtctattca 


gattttctat 


29040 


ttcttcctta 


gttttagtag 


tttgcatgtt 


tctaggaact 


tgtccatttc 


agcaggttat 


29100 


ctaatttgtt 


ggcatataat 


tgttcatagt 


attctttcat 


aatcacattt 


atttctataa 


29160 


ggtttgtagt 


aatgtttcca 


cattcatttc 


tgattgtact 


aatttgagac 


ttctctattt 


29220 


ttctttagtc 


tcattaaagg 


cttttccatt 


ttgttcatct 


ttcaaagaat 


caacttttgt 


29280 


gttcattgat 


tttatctact 


gtttttctat 


tttgcttcta 


aataattttc 


actttaatct 


29340 


ttactacttt 


atttccttct 


gctttcttca 


ggtttgatat 


gggttttttt 


tattttcatt 


29400 


atcttaagat 


ggaaagttag 


gttattgatt 


tggggtcttt 


cttctttgtt 


agtataggca 


29460 


tttacagcta 


taaatttcta 


tctaagcact 


aatttagctg 


catcctataa 


gcattgatat 


29520 


gctgtatctt 


cattttcatt 


catcgcaaag 


tattttctaa 


ttttcttttt 


atttcttctt 


29580 


taatcccttg 


gttatttagg 


agtatgttgt 


taatttccac 


atatttgggg 


tttcccaaat 


29640 


ttgtgttatt 


aattttaatt 


ttgttccatt 


gtacttagag 


aaaatacttt 


actatttcta 


29700 


gccttttaaa 


tttattaaaa 


tgtgttatgc 


cctaaactct 


ggtctattcc 


aaagaatgtc 


29760 


ccatgtgcag 


ttcagaaaac 


cgtgtattct 


tctgttgttg 


ggtgaaatgt 


tctatagatg 


29820 


tctgttaggt 


ttagttggtt 


tataatgttg 


ttcaaatctt 


ctatttcctt 


gtttatcttc 


29880 


tgccttgttg 


ctctactcat 


cattaaaagt 


gggtcattga 


agtccctaac 


tattattgtt 


29940 


aaattgtctg 


tttctttctt 


tgtttctgtc 


tggttctgct 


ttgcgtgttt 


caagcctctg 


30000 


ttcttagaaa 


atatatgttt 


ataaccatta 


taaaatgttc 


ctctctatct 


ctagtagcat 


30060 


tttttttttt 


ttgagacgga 


gtctcactct 


gttgcccagg 


ttggagtgca 


gtggccccat 


30120 


ctctgctcac 


tgccagttcc 


gcctcctggg 


ttcacaccat 


tctcctgcca 


cagcctcccg 


30180 


agtagctggg 


accacaggcg 


cccaccacca 


cgcccagcta 


attttttgta 


tttttagtag 


30240 


agatggggtt 


tcaccatgtt 


agccaggatg 


gtctccatct 


cctgacctcg 


tgatctgccc 


30300 
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gcctcggcct 


cccaaagtgc 


tgggattaca 


ttaaaatata 


ttttgtctgg 


tattagtata 


ggattacaaa 


tatttttcta 


tttttttgtt 


tgtgtctccc 


atagacagca 


catagttgga 


tgcctttaat 


tgaattgttt 


tattaattca 


Lwuau <_ M o >-* 


CI l_ l_ L. L. dV. 1*- \_ 1 — 


t~t"r , t"<T*at~crt~ 


gactgcttta 


ttttttggct 


aagtggatat 


ttttttactg 


tatatttcag 


ttatttcatt 


acttatcaaa 


atctgcttca 


catttatact 



atccctgtat agctctattt caatacctct 

20 

tgaatgttac aaacataaga taacattatg 

aaagaagcaa agagaagaaa aggaacaagt 

25 tatctattta ccattacaat ttattattta 

ctcctcattt gttcctgtgg gttcaaattt 

ttccttagct taaaatacat ttgctcctct 

30 

attgcatttc cctatgttat agacccaaca 

tgctttttaa ataagttagt agaagaaaga 

35 atttgtataa ttttcttttg tcagtgatct 

tgtagtcact tgcttttggt ctgaagaatt 

caacaaattc tttcagattt tgttcatctg 

40 

aagacagctt tgctagaagt aggattcttg 

ttgaatttgt tatcccacta ccttctgggt 

45 aatcttattg gagttccctt gtatgtctca 

ttctctttgt tggctttcag cgtttttact 

agtttcttcc ttgggatttg ttgagattcc 

50 

atatggaaaa ttttcaggca ttatttcttt 

cccctttctg gtactttcat tatgtatatg 

55 taagtctcta ttcatatttc ttcattcact 

ctatcaatct atcttgaaat tcactaattc 

gccactctaa cgaatttttt attttagctg 

60 

ggtgttttaa aaatgattta tatatcttta 
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ggcatgagcc accatgcccg gccaacattt 30360 

gccattctag ctttcttgtg tttgttcttt 30420 

ttcattcttt ttgtgtcttt gaatctaaag 30480 

tctggttttt gtatccaatc tgacaatctc 30540 

catttaatct tatattgaaa tagttaggct 30600 

tttatgtttt ttgtttctct attctgcctt 30660 

tttctaatgt agtattttaa tttttaataa 30720 

aattgtcgct ctaaggctca tcatatttta 30780 

aatttaattc cagtgaaata tagaaacatt 30840 

ttatgtgata ttattgatgt attacatcta 30900 

attattactt tatataattc tatctctttt 30960 

atacatttac agtttttgtt atttaccatt 31020 

ccagaaattt tttatttata atttcttgtt 31080 

aacatctaga gttaaactcc tagatgttga 31140 

ccccccactt ttgctgttat tggcaaatat 31200 

atacgttaca tacatattgt ttcatatgat 31260 

ggataaattt gcatttacac tttcttttat 31320 

ttatattttc atgagattca aattactgtc 31380 

ttctttagta ttttttgtaa ggcatgctag 31440 

caaatgtctt tattttgttt tcatttttga 31500 

gttgataggt cttttttttc tttgaaccct 31560 

ttcaatgttt ctgattagaa gtcagctgtt 3X620 

agtcattttt ctcttgctgc tttcgagatt 31680 

gtgatatgtc tacttgtgga tttctttgtg 3174 0 

taggcggtat agattaatgg ctttgaataa 31800 

gaatatattt tctgcttctt tctctttctc 31860 

ttgtttcctt aatggtattt cacatttctc 31920 

tttctctctg tctttcagat tgcataatct 31980 

tttcttctgc cacttcaaat ctactgttgt 32040 

ttataatttt caactccaga atttccattt 32100 

ctgatattct ctatttgatg caatattgtt 32160 



-293- 



WO 02/062946 



PCT/US01/48437 



atcacacctt cctttacttc cttaatcgtg 
tgactatttt gaaacatttg tttattaaat 

5 

cctacagttt atatcatcta tgggtcaaac 
tgcctattat ataattgttt tgctggaaac 
10 tctgggtact catcccccac accaggactt 
tctaggttgt ttcagtgaag tctatttctc 
cttatagggt acaggcttgt gatgcccaca 

15 

ggctctttgt ctctttccct aaccataccc 
tgttctgagg caaaaactgc ttcacagact 
20 gatagctccc aaaatcagtg tttgagattt 
tccccttttc ccatttctct ctgggaaact 
atgtatctgt gaacacctac caagtgcctt 

25 

cttgtaagcc taaacttctc cacatgctac 
ggagatttct attccatagc ctatttctct 
30 aggggtgagg aagtgggtag ggacaacgga 
gaatggggga ggggacagaa gccccagttc 
cttaccccaa agccagaaaa aggtcaatta 

35 

gagtaaaagc tgggtagaag agatcaccca 
ctcagtgtta gtatttgggc aggaccagaa 
40 aagatagtat agcactctga ctgggatctt 
ttccaatcta gagtggtctt gctgatctga 
cagataccat accagaaact cttacagcat 

45 

cctttatttg ctatattcca ttaaaaaggt 
gatttttgaa ataattttta caaatttggc 
50 atgctggaag tgggactctc caaaaataat 
tattagaatt tttaggaaca tagatacctt 
atcctcatta atggacatag gataagtgtg 

55 

ctttgtacta aggatgaatc tttttagttg 
acaaagctaa aattgaaaat cctaatcaca 
60 catgtattaa atgtattatt tatacatgac 



gcttacttaa gtgctcaaac atctttataa 3222 0 

attatatctg gtctctcaca gtttttgttg 32280 

ttgcctgttt cattgtctgt tttatattgt 32340 

tgggcatttt cgataatata ttatagcaac 32400 

attgttattt acttgattag aaaaaaactg 32460 

cgtcctcact gttaagcttt gatgttgccc 32520 

gtcaccctgg gatgacattt atcttggaaa 32580 

agctgttaag ctctgctagt gcttgctggt 32640 

aacccaaaca aatttgagct cctttgaggg 32700 

gttctgactc caaaagggct cttcctagct 32760 

cgtgggccta cagtttagcc tgtatctaaa 32820 

ttaccacaac ttccactgtt tttgagagta 32880 

aaatgaagtc acctcctttg ggaacagagt 32940 

cccaacatct ctgagctaag gatctggtgc 33000 

gtctttctct ttaagagttg agggtttggt 33060 

ctctcaactt gcctctcatg atgtgaaacc 33120 

gagcccagta ttatagagtc ttcatcccat 3 3180 

tctcttggct gccttcactt ggtacttacc 33240 

aggctaatta gatggcctac ccttcctgag 33300 

tgagaaaagg gagccctgtg tttttggcta 33360 

gaggagggag ggagggtgca gatcttggtt 33420 

ttactgagta gattttcttg aataaatgtt 33480 

ttacagagta cagaggtttt aaaggattaa 33540 

tggggagctg gttgagttcc tcccattctt 33600 

aatgctgtac acttaattct ataaataatt 33660 

cttaagtgat cctggttcat aagcctctct 33720 

cagaggcatt ctaagcatac cacaatctga 33780 

acaactccag cagcatagtt ctacatattc 33840 

atatactaga ttattcctcc tttagtatgc 33900 

taatacatat atgtattaat acatgacatg 33960 
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tattaaatgt attaaatgta ttaaatttta 
cccccattag attaaaagct ccttgagtag 
5 gtataattaa tgaataaata aatattccct 
gacaaatttg aggtggactc ctgatatcaa 
ttaaaacttt aacttgaatg ttctagccag 

10 

gtttatgctg tccataacca atttttcttt 
gtaatgaaca gcataactat ttgtgtgact 
15 attgcaactt ttgcattaaa tgcaaccaca 
aataataatt agatcaaaat tttaatataa 
aagtgagtaa ggcatagaaa aggccaccaa 

20 

aaagataaac tgtattttct ctatattctt 
tcaaaaatcg tgatgcacta taaacctcaa 
25 tgatagaata cttagaaacc ccaaagacct 
tttagagagg tggctggaca aaagaaaaac 
agaaataacc acttaaatat tgaaattttt 

30 

atgtttaata agaagcattt taaaaggact 
cgaatttaca aaatcataaa attgcaagtt 
35 ttccatttag aatctcaaca gcatttcgga 
tatagataga tgaataattg cataaaaatg 
gaggaggatc agataatttg ctgcactaga 

40 

aaaaatagta cagtatttac acactaaaag 
tgaaaaataa atctaagtat aagtgaacac 
45 tcaatagaga aaagatgaaa ttggaaaaaa 
caagaaaata ataatggaca cctactttgt 
tatgtgtaca taaaaaccat taaaattagc 

50 

taagcaacag agaaaaccaa gaaggaggaa 
ttgcactttt tatacaatac aggctgagac 
55 ccagtgggct gagggagtgg ggtccccacc 
agctgactca gaagacagaa ccacagagga 
ttttctctat tgggttttgg acttgcttag 

60 

tctctattgg aaatcgaaat gactatcctc 
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ttaaatgaaa ttgtgcttgt aattatatct 34020 

aaacctgagg tttagtacat tgtaggcatt 34080 

ctgaaaaaaa aattatccta ctagcttcta 34140 

cctctgtgac aatcaaagaa agtcagattt 34200 

taggacagac tgattgcttc agcaacatct 34260 

gtatgtctat tacaccttac tttagcaatt 34320 

acacttggta taattctatt aagcatggag 34380 

tacaattggt caaacgtaag taccaatttt 3444 0 

atgaaaaatt atggtgaatg atgtatgttg 34500 

atgtgtcaat tatctttggc atagaaatgt 34560 

gctataagca cattgaaaaa taagtttttc 34620 

agagagatta aaattatcgt taagataaaa 34680 

acctgaaaaa caactgcaga agtcaaacag 34740 

ctgcaaaaat taatagattg tctttacact 34800 

aaaaaaatca aaataacagc aaaactataa 34860 

gaatactttg aaaaattatg ttcttgaatg 34920 

tctgcaaatt aacacataat tgtgatgcaa 34980 

agtcatgcaa taaaattact ctaaaatgca 35040 

ccaagaaagt ctagaagtta catagtgatt 35100 

tattaaaact tgttattagg ctattataat 35160 

gaaaatttaa ctaagcacag aatagagagt 35220 

atattatttt gtaggtgata acatgaccag 35280 

acagtattaa atcaataagc tattgatttc 35340 

accacaaata aaaataaatc acagatgaag 35400 

ataaaatata gaactaatat attatttaca 35460 

agatagaaag acattttatg tcacaaatat 35520 

aacaatagcc taaagagtta aaatactgcc 35580 

ccagggtgac aaagaatagg gctgtcccct 35640 

ttatcctcag gcctcaaaaa tctaacaaaa 35700 

gtctgctgcc cacttctttc tttcttattt 35760 

tgccagtccc accattgtat tttagaagta 35820 
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gattactagt 


tttctggctt 


cacaagtcca 


cagatagtga 


ggaatttttg 


ccccaagatg 


35880 


5 


aatcatgcta 


taagtgatga 


tttctagcat 


gaatctagat 


gctataactg 


atttagatga 


35940 


tgagatttga 


ggatttttga 


ttttataata 


tttaggaaaa 


gtttggacat 


agagttgatt 


36000 




ctagagtggg 


ttacaacttt 


cggagatgtt 


gaaattagct 


gaatatattt 


tgcacatggg 


36060 


10 


aaggacatga 


attgagggga 


actagaggca 


gactgttatg 


ggttgaattg 


tgataccgaa 


36120 




acatatatgt 


tgcagtccta 


acccatggca 


cctcagaatg 


tgacattttg 


ggggaatggg 


36180 


15 


attattccag 


gtataataag 


ctaaattaag 


atgaggtcat 


acaggggtag 


gtaggtcatt 


36240 


aatccagtat 


gattggtatt 


cttataagag 


agaaatttgg 


acataggtac 


ataaggaaga 


36300 




tggcactgtg. 


tattagttca 


tttttacact 


gctataaaga 


actgactgag 


acagagtaat 


36360 


20 


ttataaagga 


aagaggttta 


attgactcac 


agttttgcat 


ggctcgggag 


gtctcacaaa 


36420 




acttatagtc 


atgacagaag 


gcaaaggaga 


agcaagtacc 


ttcttcacaa 


ggaggcagga 


36480 


25 


gagagagcaa 


aaggggaaca 


gctacagact 


tttaaaccac 


catatctcat 


gagaactcac 


36540 


tcactaccat 


gaaacagcat 


gggggagcca 


ctcccatgat 


ccaattacct 


cccaccaggt 


36600 




ccctccctca 


acacgtgggg 


attacaattc 


gagatgagat 


ttgggtgggg 


acacagagcc 


36660 


30 


aaaccatatt 


attccatccc 


tggcccctcc 


caaacttcat 


ctcctgctca 


catttcaaaa 


36720 




cacaatcatg 


ccctcccagc 


agtcctccaa 


agtcttaact 


cattccagca 


ttaattcaaa 


36780 


35 


agtctaagtt 


caaagtctca 


cctgagacaa 


ggaaatttcc 


ttccactgag 


cctgtaaaat 


36840 


caaaaacaag 


ttagttactt 


ccaaggtaca 


atgggggtac 


aagcattagg 


taaatgttcc 


36900 




cattccaaat 


gggacaaatt 


ggccaaaaca 


aaggggccac 


aggtcccatg 


caagtctgaa 


36960 


40 


acccagtagg 


gctgttatta 


aattctaaag 


cttcaaaatg 


atctcctttg 


gctccatgtc 


37020 




tcacatctgg 


ggtacacaga 


tgcaagaggt 


gggctcccaa 


ggccttgggc 


agctccaccc 


37080 


45 


ctgtggctct 


gcagggtaaa 


gcctctgcag 


ctgttttcat 


aggctgatgc 


tgagtgcctg 


37140 


cagcttttcc 


aggtgcgttg 


tagaggctgt 


cagtggattt 


accattcttg 


ggtctggaga 


37200 




atggtggccc 


tcttctcacg 


ggtctagtgg 


gcagtgcccc 


agtggggact 


ctgtgggggc 


37260 


50 


ctcaacccca 


catttcccct 


ctgcagtgcc 


ctagtagaag 


ttctccatga 


gggctccaac 


37320 




cctgtcacag 


ccttcttcct 


ggacatccag 


gcattttcat 


acatcctctg 


aaatctaggt 


37380 


55 


ggagcccacc 


aagcctcaac 


tcctgacctc 


ggtgcaccca 


caggcccaac 


accatgtgaa 


37440 


agccaccaag 


gcttggggct 


tgctccctct 


gaagcaacag 


cccaagctgt 


accttggccc 


37500 




cttttagcca 


cagctggagc 


tggtgtggct 


gggaagcagg 


gccccaagtc 


ccgagactgc 


37560 


60 


acagagcagc 


agcaggggcc 


ctggacctgg 


gccacaaaac 


cattttttcc 


tcctaggcct 


37620 
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ccaggcctgt gatgggaggg gctacagtga 
cccattgtct tggctattaa tatttggttc 
5 cttgaattta tccccagaaa atgagttttt 
tccaaatttc tatgctttgc ttccctttta 
tgctcacaca tatgtgaaca cacctctaga 

10 

tgcttagaaa tttcttccac cagataccct 
aaatctctag ggcagggcaa aatgccacca 
15 ctttgctcca gttcccaaaa agttcctcat 
cattgtccat atcactatca gcatttcagt 
cctaactttc ccacatcttc ctgtcttctt 

20 

ctatttccca gttccaaagt tgcttccaca 
ctcctggtac caatttcctg tgttagtcca 
25 actgggtaat ttataaagta aagagggtta 
gcctcaggaa atgtacaatc atggcggaag 
ggaagcagga gagagagagt aaagggggaa 

30 

atgagaactc actcactatc atgagaacag 
acctcccacc agatcccccc tcaacacatg 
35 gggacacaga gatgaaccat atcaccatct 
attacaagtc aaaaacacct ggggctacca 
cctatatgcc ttagaaagag catgatcctg 

40 

cagacccata aaacaataaa cttctgttgt 
atattgccct aggaaatgaa tacatgagga 
45 atacttagtt gttatgaaaa tacccaaaaa 
aatatttact cctgattttc taaatgttga 
tgaagatttt tttattcaga tggttggtat 

50 

catcttagag atgaccattg taatcataat 
tgaaaaacac actgaagaaa gtgagtaact 
55 cagtttataa taagggccct ctctgagagt 
tgcttcagaa gtaaggtcaa gtgtatctga 
tgtgtggaat tgaagtcaaa caacttgccc 

60 

tacatctccc aggagattaa cagaggaaat 
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agatgtttga 


catgccctgg 


agacattttc 


37680 


cttgttactt atgcaaattt 


ctgcagctgg 


37740 


cttttatacc 


atgtatcagg 


ctgcaaattt 


37800 


aacacaactt 


ccagtttcag 


ataatctctt 


37860 


aaaaaacagg 


tcacctctta 


aacactttgc 


37920 


aaatcatctc 


tctcaagttc 


aaagttccac 


37980 


gtctctctgc 


taaagcatag 


caagactgac 


38040 


ctccatctga 


aaccacctca 


gcctggactt 


38100 


caaaagcatt 


caacgagtct 


ctaggaagtt 


38160 


ttaagccctt 


caaactcttc 


caacttctgc 


38220 


ttttcaggct 


atctttatac 


cagtacccca 


38280 


ttttcacact 


gctataaaga 


gctgcctgag 


38340 


attggctcac 


agtttcacat 


ggctggggaa 


38400 


gcaaaggaga aacaagtacc 


ttcttcaaaa 


38460 


cttccacaga 


cttttcaacc 


atcaagtctc 


38520 


catgggtgaa 


actgccccca 


tgatctaatc 


38580 


gggattacag 


tttgaaatga 


gatttaagtg 


38640 


gaacatagag gcagagatag 


gagttatact 


38700 


gaacctggat 


gagacaaggg 


aggattctct 


38760 


ccaatacatt 


tattttgaac 


ttttggcatc 


38820 


gttaagccac 


ccagtttgtg 


gtactttatt 


38880 


caatatgtaa gtactcaata 


aatttaaaac 


38940 


taatcataat 


tgtgtgagac 


tgctaggatg 


39000 


tcttatataa 


ttttaaatgg 


ttaattaata 


39060 


gatgctttta 


caatatttac 


tactctgcac 


39120 


catccagtgg 


gttgtaagag 


catttcatca 


39180 


ttgtttctgg 


cagtcaaatc 


aagataaagg 


39240 


tcaaagttta gacttggtga 


acaggagtgt 


39300 


aaacatgctg 


aaatatcttc 


tgcttgatat 


39360 


tggcaaaact 


atgtagaagt 


cttttagact 


39420 


atatagacta 


aaagggtttc 


agagatcata 


39480 
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cttgtgatat ggggatactg actgctataa 
ttggtttcca acatttggaa tatcaaatgc 

5 

gaaaagagta cttctagaat agaacagtaa 
gcaatgagaa tactgaaaaa tttgtcaaaa 
10 aaagacctat gaaaatccaa taagcattta 
acagtgagct ctatggtatt gcagcgtgcc 
agtagccttt aaaagtaaca gccttgcaac 

15 

aacaacagga aaggataaaa taggatttaa 
ttgttgttaa cttgtctggc agctccccag 
20 atttcattca gaactcagtc actgtaaaca 
atcaatagca tttaacaaca cagctgcctg 
ggtcacattt taaaaaaaaa tctgagcagt 

25 

acaatatctg gaaacataaa aggtcacatg 
agacctgaaa aatgtcctaa cctcccacct 
30 cattcaagtt aaatggccga atcatggtaa 
tcctattccc atccctcaca acccatctcc 
ctaaggcaga gttgcaaact ctcccagagc 

35 

caaacacgca cacacacaat ctctcagcaa 
taaggaaatc tttgtccaat cactagctga 
40 acaatgataa aaccaacccc tagagggcta 
ttaaatgttc aatttttaac aagtaaaaat 
gcttaaatac aggagaaaaa aacagtcaat 

45 

gaggttacta gaaaaaaaag gtattttaac 
ttaaaaacca tgtctaaaaa aactaaaagt 
50 attaatagtg acataaaata ggaactattt 
cagaaaatta taataactaa aatgaaaatt 
ctggcagaag aaataatcca caaacttgac 

55 

aaacagaatg aagaaaaaag aacagagctt 
caaaatgtgc ataatggtga acacataagg 
60 acagtgagtt tgtgaagaaa tattggaaaa 
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acccaaaaat 


tcaaatttag ttctgaaatg 


39540 


cttcagaaaa 


taaacatttt agaggacact 


39600 


gaacctccaa 


aaacccatct ctcaataaaa 


39660 


tcaacttttt 


cagaactttg gaaattagtc 


39720 


ttcaagttaa 


atggccaaat catggtaaga 


39780 


ctattcccat 


ccgtcttacc ctatctcctc 


39840 


catagtacct gtgggaaaaa acggcctaac 


39900 


gtgccccaaa 


aaagccccat tctcagagaa 


39960 


acatccccac 


tggcaagact ctttaatttg 


40020 


gcttttcttt 


ctgggatatt tgtcagaaca 


40080 


aggcagagat 


aacagttgtg ccaaacaaaa 


40140 


aagatgtcca 


tagagtattt tgaaattatg 


40200 


catatgctgg gatgtgtaca tgccaagaaa 


40260 


ctggctgacc 


tgaagaaaat ccaattcaac 


40320 


gaacagtgag 


ctttgtggta ttgtaacttg 


40380 


tgcttttatg 


ataaaagcag gaagtgatag 


40440 


tttaaaacct 


taccccaaca caaacacatg 


40500 


agtctaggaa 


acttactggt tcaaaagaat 


40560 


ccactaagtt 


aacaaagcag aataacaaag 


40620 


tgatttacaa 


tgttgccaca ctatattatt 


40680 


ataaaatatg 


caaggaaaca gagaagtatg 


40740 


aaaaattttc 


cccaaagaag tctaaatgtt 


40800 


tcagcaagta 


taaataagta taaagaactt 


40860 


atgaacgcaa 


tgtctcacca aaaagagaat 


40920 


ttccaaaaga 


accaaataga aattgtagag 


40980 


ttactcaagg 


gtctcaacag aaaatttgag 


41040 


agtaggtcaa 


ttggaattac ctaatctatg 


41100 


cagacacctt 


tgaaacatca gcaagcatac 


41160 


aaagaagaga gaaagaaata aaaaaatggc 


41220 


tgtatagctg 


taaagactat attttgaata 


41280 
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agtatctgaa atcaaaaacc caactttccc 
ctaaagtgaa aataataaag actagagtag 
5 tagagaaaaa caataaaacc agaagttcct 
attaaccaca ctgaccaaga aaaaaagaaa 
agaacattac tgacaatgtt atagaagtaa 

10 

agataaccta cataaaatgg acaaatttct 
agaagaaata gaaatctaag tacacctatc 
15 aactttccat aaagaatagc ccagtctcag 
taaagcatta atactagtta ttcacaaact 
cttcctaact cattctataa ggctaacact 

20 

ataataaaag gaaactatag actaatatcc 
aactattaac aaatggaatc tagcaacata 
25 gatttatcct aagagtacag agctggttca 
ttactaatag aataaaagcc agaaaccaca 
ttaagaaaat tgaacaccct tttatgatta 

30 

attttctcaa cctaaatatg ggcgtttatt 
aagagacact gaatcatttt cttcttaaga 
35 caacattgta ttgtaggttc tatccaggaa 
ttaatcaagg aaaaattaaa actaacactc 
aaatgctaaa aaaatccata aaaatctagt 

40 

agaatacatg gccaatatac agaaaccaac 
gtaacatcaa aaggaataaa tatacatata 
45 aagtgcgaga cttgtctacc acaaactcta 
caaacaaatg aaaagacatc ccatgttcat 
atgatggtaa tactccccaa ccaatttaac 

50 

ttcagaaatt gacaagctta tcctaacatt 
caaaccaata ttaaaaaaga aaaacaaagt 
55 gggttaaaga aagacacaca cacacacaca 
taaaaagggc acctcagcag caaatgcagg 
ggggaaccag tgtaatgcca gagcccaccg 

60 

tgggtgggag gggtctaggc agtacggctt 
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ccttaaaaaa tcaggaaaag aaaactaaac 41340 

aactaaatga aatagaagat aggaaaacaa 41400 

tctttaaaaa gaatcccaaa ataacaaacc 41460 

gaagacttaa tttactataa ttataaagaa 41520 

aaatattgaa caatcatatg gcaacaaatt 41580 

agaaatacag acatgactga aactgactca 41640 

acaagttaaa gaatcaaatt aataatcaaa 41700 

atagctacat tgtgaactct accaaaggtt 41760 

ttttcccaaa aaaaagagga ggaataaata 41820 

agtttggtac taaaataaaa caaagacatt 41880 

attatgaata tagatgcata aatcctcaaa 41940 

caaaaagtat tatatatcat gaccagttag 42000 

atatatgaaa actaatcaat gtattccacc 42060 

tgatcacctc aaaagacata gaagaagcat 42120 

aaaaagttga aaaactagga atagaatgga 42180 

aaaaatttat agttaacata atacttgata 42240 

taagaaacaa gatatcttgt cacttacatt 42300 

aatcaggcaa taaaaaataa agaatttagt 42360 

tttacaaatg acattatctg atctcgtata 42420 

cgaactaata aataagttta acaaaatggc 42480 

tgtatttttg tatacaaaaa cacagttaca 42540 

cttcacattc aggaataaat ttttaaaaag 42600 

aaacattatt gaaagaaatt taagaagata 42660 

tgataaaaga caatatttga agacaatatt 42720 

acaacccctg ttaaattaca gctggcattt 42780 

catatgaaca tggaaaagac ccaaaatagt 42840 

aggaggtgtt aggttttgaa gggaaggtaa 42900 

cacacacaca cacacacaca cacacacata 42960 

ctttacatcc agcataaaac ctacagaagt 43020 

ctgcttatag cctgggggta cttataggta 43080 

gctacccagc aggatattga taagatgttc 43140 
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ccatgatgag gtggttctgg cccttggttt 
ctttgcccag caagatataa tagggatgtt 

5 

tcatgtggtt aggcagaatg tttctcacgg 
ccaaggtctg caaaatagca gagaatttac 
10 ctcgcacttc ctagtttcaa aagttactaa 
agcaggatga gccgtggaca aaactccaca 
atcagctgga agcatcggca gactagcgtc 

15 

tctgtccctt tttttttttt agcgctcaca 
attgattgag cagtctaggg tggtaccgga 
20 agagtgaaac agaacagaat ggaaaatttg 
ctatggataa catcagttgc taagtcatgg 
gcaggcccag gcctggtttt gggtctggtt 

25 

gcctttggtt tcgcttcctt gtttcttctt 
aacaatatgg gggggtctct ttctttcttc 
30 aaaaatcagt agaataggat caaaaaccca 
ttgactagca tgacaagaca attcaatgga 
aggacaactg gatatctgca tacaaaagaa 

35 

cacaaattaa ctaaaaatgc atagacctag 
gaagaaaata caggcataat tatttctggc 
40 caccaaaagc atatttaaca aaagaaaaat 
acagaccaac atgtctgatg actacggata 
caaatctagc aacatataaa tatgattata 

45 

attcaaagtt agtttaacat accaaaatca 
aagggcaaaa acatgctcac taaacaaagt 
50 gattttatct ccttacttat ctatcttccc 
aaaaacaaaa atctgtagtt tggtgtcaac 
cttgtttgtg gaattcctgt aggataatat 

55 

agaagtacct tgagaccaga tcttaatggg 
atttagtcta tagataccag ggtgctcttg 
60 aaacttttca tttatataat tattttcatt 
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ggcagaatgt 


cattatggtg 


ttccttggac 


43200 


tctttagttg 


ggcctttgtc 


tgccttgtgg 


43260 


cccgaactcc 


tgtgaaatgt 


ttcactttga 


43320 


aaaatggtgc 


agtttggact 


aacaggaagg 


43380 


aaaattacag 


taatcaagaa 


attgtggtgt 


43440 


gacactgaga 


aagtgaagga 


agtagcttta 


43500 


ttaaaatcca 


agcttgttga 


gtgcacaatt 


43560 


atactaaagg 


ttt'cgcatga 


aagggttgtg 


43620 


caggggctgc 


agctacatgc 


accagtaatc 


43680 


ttgtatgtcc 


ttccacacaa 


tgtctggaat 


43740 


gttgaatttt 


aaccatcagg 


ctaaggtcag 


43800 


ttgggtctgg 


tgcctggcgc 


caggctgcct 


43860 


aaaacaggta 


ctgagtataa 


aacagtatag 


43920 


tctcagtgat 


actgccataa 


tgatagacat 


43980 


aaaataagtt 


catacgtagc 


tacttgattt 


44040 


gaaatagtgg 


tctcttcaac 


aaatggtgcc 


44100 


tgaagttaga 


ccactacctc 


acagaatata 


44X60 


atttaagagc 


taacattaca 


aaactcttag 


44220 


ctgaattagt 


taatggtata 


ttagctatca 


44280 


catagataaa 


ttgaccaagg 


gaagaaaact 


44340 


caaaaatctt 


caaaaaaaaa 


actagcaaac 


44400 


caacagggtc 


aagtgagatt 


tattacagga 


44460 


atcaatgcaa 


tatatttcat 


taaaataata 


44520 


gatgaaagga 


aatagattgt 


aatggctgat 


44580 


taaatccttt 


caaaatagat 


aattaattcc 


44640 


ctaaaactct 


gtcaggagcc 


aaaggaatgg 


44700 


gtgggggatc 


atggcaggaa 


atgaggctgt 


44760 


gcttaactat 


accttattaa 


agaattttga 


44820 


cagggttttt 


aacaacagtg 


tttcttgatc 


44880 


agaatatttc 


ctgcagcaag 


ttacagccct 


44940 
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tcccccctac cctcaactca aactgctaaa 
cggcttaggc agcataattt ccggttcaca 
5 gctacaagga gacctctctt tacttcactt 
gagttgagta atagcaaagg gaataggatt 
gcaggataat tttgagggag tcaaccagtg 

10 

tagagatagg aaaaagaaga acaaagctag 
aggcaagcca aacaatgagt acctaaacct 
15 atttgaaaga tatttaggag gcttattaat 
aaatcttaga ttaaataaaa ttagtttaat 
tactgctcaa taactattga acatagcttt 

20 

aaactatttt ttaaatcatg tattttcagt 
ttctgctgtg aacttaaaaa caaaatagaa 
25 actgttccca ttagaatggc ttctacagtg 
tggctctccg ctccaatggt tgttctcttc 
aggaatcagg actcaccaga gtccagttaa 

30 

tatgtttgtt gtagttcatt gaaacctctg 
tttggctagg ccccccagaa ctgagtaggt 
35 acattccagg tttacatatt tgttcatttt 
tgtctcagga taaggctttt attttagact 
atctcaggca ctttgccctc ggaaatctca 

40 

acaacttcta tttcttcata atttgcaaag 
attaattgtg aattagcgaa gaactctaat 
45 ctaatgaaat cctttccatt gctctttttt 
tcacaaattg actcttccct ttttaaaagc 
cctgcaatcc ctgtggttga agggatgtag 

50 

aggttggagg agattggctt gaattaagga 
gattttggaa gttggcataa tgttcatgtg 
55 ttttgttttc tcatctgtaa aacagaaata 
gattaaatga caatatatca ttaaaagagc 
tcaataagtt tttattattt attatttatc 

60 

atatatgaag ccagattgaa tttttcttgt 
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tcaatcagga atcatctcat gatcataaga 45000 

tccagcagga gaaactggcc tttctcagaa 45060 

ggcttaattg cctcagatct gctaatctct 45120 

accatgaagg attcagacaa atcatttgta 45180 

tcctcagtca ttctgggagg aaaatagaga 4524 0 

agatgaatat aattagctgg ttgccccaac 45300 

aaacaatgaa gaataaagag aagatgataa 45360 

gtcttgtatg ttcgtttcac actcaatgtt 45420 

aaactgctta tcccttattt ctatttccca 45480 

aaatactaac ctgtatttta cttgattagg 45540 

tttgtgtgta tgcgcttata cgatataata 45600 

aaacagaggt ttactcaaga aaagaatgac 45660 

gctcccaaaa atcctgtagt atgagccaga 45720 

tgatctagaa tatgaagtag agtacccaag 45780 

aatcatttta agaagaataa tttagtttaa 45840 

gtatctgatg agttataggt ctgatatgcc 45900 

actgtcttct cagtacactt ttaactaacc 45960 

ttgttgttgt tttgtcttac tttaactcaa 46020 

gtaataatga tcaatttgct atctcaattc 46080 

ctgtcggaat ccttggaaat cacaccacgc 46140 

aaccagtgtg actccagtca agaacaaaag 46200 

tttttcaagc agccacatgg aaaacgcatt 46260 

cttagttctt caataaagca tgcgatttga 46320 

cagtatgggt taacacggat aaatatagcc 4 63 80 

gagagtagtg ttgaatgccc actttatctt 46440 

actccttcat ccaagaggag aaaatactct 46500 

ataatgtcaa tactcttgct ctctctgtac 46560 

atagtctcta acccacagag tagatgggag 46620 

ttacaacagc tcctaactca tagttgtccc 46680 

agctcttatc attgtatcag tagcataatg 46740 

tttccaagaa ttggtgacta gaacctccct 46800 
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ctgaaagcag atgtactgaa atgaggccac 
atgtattgag ttgtgtcccc ctcaaaaaaa 

5 

aatatgacct aatttagaag tagggtcttt 
taggagagag tctaatccaa tatcactggt 
1,0 cgcagtggct cacgcctgta atcccaacac 
aggtcaggag ttcaagacta gcctgaccaa 
caaaaattag ccgggcatag tggtgcgcac 

15 

caggagaatc acttgaaccc aggaggcaga 
actccagcct gggcgacaga gtgagactct 
20 acacagacac agatatgcac agagggaaaa 
gtaaagatga aggattgggt gatgcatcga 
aatcaggagg acgtcaggaa ggatccttct 

25 

agagcacctt gattttggac ttccaacctc 
taacaccttg ttttaaccac tcgttttaac 
30 cggggaaact aatgcaggtg gcttttcact 
ggaacatgtt aggtgtttat gaaagcaggt 
agtgcagcat tcgaacggct ctactctctg 

35 

gtgtgtggct gtttacagga agcatcccac 
ggacttgctg acctgatcta gaaaagccag 
40 atggtatagg agcctgcaga gagggaccat 
aggtaaaaaa gaaagatacc tatcaccact 
gggaaatgag actggaagcc tcaggttacc 

45 

atatgaaagc acttcagcaa aatgctgcca 
ttaccaaaga agcactaatg cctaggagag 
50 tggccagtag aagcttccac atacctgcct 
ctatattctg acatgggagc tcatttaatt 
ttatatgtat aaaatttgtc attttatagg 

55 

atcataattg tataatttat atgcatattt 
aaagcctctg ctgtgaacat tacatattga 
60 cagtcttatt ctgatgttat ctccaggagt 



aaaactactt aaaatcttga tgaaaatgtt 46860 

tatattgagg tcctaatccc cagtactcag 46920 

aaataaccaa tcgaattaaa atgaagtcat 46980 

ttccttataa aaagagaaaa ttgggccagg 47040 

tttgggaggc cgaggcaggt ggattgcttg 47100 

catggtgaaa tcctgtctct actaaaaata 47160 

ctgtaatccc acctactcgg gaggctgagg 47220 

gattgtgatg agactagatc gcaccattgc 47280 

catctcaaaa caaagaaaag gggaaattag 47340 

cgatgtgaag acacacaaag agaaaaccat 47400 

caagccaaga agtgccaaag atttctagca 47460 

cctccaggtt tctgagggag catgaccctg 47520 

cagaacagtg agataataaa tttctcattt 47580 

cactcgtttt aacaccttgt tacattagct 47640 

ataaggggca ctagatttcc cccaccccat 47700 

tatcctctca cctcagcatt ccacagtagg 47760 

aagctcgggg agagaagaag cagctgtaaa 4 7820 

tccccaacga tagtctttgg gacccaggtg 47880 

agcaaaagcc agcaatgtta aaaagaaggc 47940 

ggaccactcc taaaagtcgg agcttgcaag 48000 

gcctgcaggc cccaaccctg gaacactaga 48060 

tgagaattgc tgtagcatgg ggagcaaggc 48120 

atggagggtc agtacgaagg tgtagccaaa 48180 

agcttctcag aattgagtgg ctcttaactg 4 8240 

aggttaggaa tataatccac agaaagagaa 4 8300 

tcatttaatt tttttaaaat aaaggttttt 48360 

catattttaa atgtggatat tttaatttat 4 8420 

taaatgtgca tattttgtaa cataattttt 48480 

atgtatatgt ttgggttgtc tgttgttttt 48540 

ccaccataca atttctccag agcctctgtt 48600 
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acatctatct tcttccaatc catccttgat 
atgttaagta ctagagactg aacatatatc 
5 aactttgtgt gggatagttt tctaattccc 
ctgtatttgc cctggttaat tatgagacta 
aattggcctc agcctgttct gcctacacag 

10 

tagatgaata agtctgttct tttattatct 
ttcccattaa tacctctgct ctttcttttt 
15 ctttttctct tacatacaaa gtattattat 
tatttcagta tattcaggtc tgacatttta 
tacttgccct ttatattaca gttaagcatt 

20 

ttatttgtag tttgtatctc aggatatgaa 
aaatttttaa accatctaga ttttgactaa 
25 aatgtgaaat aactgatatc agatatactg 
taaaacatca atttttaggc actgtacaaa 
tgggaaacat acaaggtgag cccccacatt 

30 

aggccatgtg gtaggaacgt agaacccaaa 
gagattaagt ttgaaactgc taaagtagct 
35 ggaagctact gaacagatac ctcagaagtc 
gaatcctggc catatccttt aatacaactt 
ccgggtgtgg tggctcacgc ctgtaatcac 

40 

gatcacttga ggtcaggggt ttgagaccag 
ctaaaaacac aaaaattact caggcatggt 
45 tggctgaggc agaagaatca cttgaaacag 
gccactgcag accagcctgg gtgacagagc 
gaaaaaaaaa aggaagtaaa ttccagggag 

50 

acacagtgtg gtgagtggtg aagtgcaggc 
tccctgacat tcaaaggata ctccagagat 
55 aggagtacag gctacacctt ggaataagaa 
cctaaaatca ggatgcaaga tgatctgcaa 
gagaggaaag agaatagtac ccagatcctc 

60 

acttggatac agcagagaac agacgagaaa 
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acacttttaa agataacagc acccaagatc 


48660 


tctacagata gtgggatctc tcatagttca 


48720 


cattccatgt 


aataacaaaa gaaattaata 


48780 


tttctcactt 


tttctacttc agcactctgt 


48840 


attaacatta 


tatcctcctt cctcctatat 


48900 


cattctattc 


tttattattt ataatagtaa 


48960 


tatatatttt 


ttgtacatat gacttctgat 


49020 


aaaaatgtac 


aggcatatct aattattatg 


49080 


tataataaaa 


tggatatttt ataataaatt 


49140 


atattgatct 


cttaaattat gtagatctca 


49200 


aaatattaac 


tctacattta ttttactgag 


49260 


aagtcttagt 


acagactttt aattccatcc 


49320 


aaactagcta 


caaaaactgg ggaaaatgca 


49380 


aaccaacaga aggtttccat tctttggaaa 


49440 


cacccagact 


tccccttggg aacattttcc 


49500 


tagttacagt ggtcttgctg gatggaagca 


49560 


ggaattcgat gggcatacct acagagatag 


49620 


tacaaagaaa 


ttttccttag gtccttagct 


49680 


cagaaggcct 


aacagaaagt aaattcctgg 


49740 


agcactttgt aagggaggcc gaagagggca 


49800 


cctggccaac 


atagtaaaac cccatctcta 


49860 


ggtgggcgcc 


tgtagtccca gctactcagg 


49920 


gagacggagg 


ttgcagtgag ccgagatcct 


49980 


aagaccctgt 


cttaaaaaaa aaaaaaaaaa 


50040 


actgagagct 


gagcatggat tcttgatcct 


50100 


gcagccaaaa 


gagagaaacc ttcttgatca 


50160 


gctacagctt 


ctaattaaga acgatgaact 


50220 


aaagggagaa 


aaagacgcac cctaacaaag 


50280 


aaatgtaatt 


gctggtgtgc ctcctccaag 


50340 


acattttgaa 


cagatcatct aggaaagaac 


50400 


caccagcagt 


aagaaaggac atggtgtgag 


50460 
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gaagcttgcc cagccaggaa ctgaccaaga 
aggaagagag aaacccctag aactccaaaa 

5 

accctgtagt ggattcacga cagcatcagg 
cagagatgtt gccccagaaa gggaacccac 
10 cagcctcagc tgggagccat tttgagagcc 
tgccactgag cagctccaag gagggagagg 
agggtacctg ccgccctgct atgggctgca 

15 

ccccacagct tcttgctcac actgcttgcc 
ccccaaggca ccatcttaag agttgagtgc 
20 aaactaacac aattacagct gctgcctagc 
atgcaaatat aggacaatac ccactgatct 
ggggatgaca ctccccacag cttcttacat 

25 

ccctggccac aggcctaagg tgccattttg 
caggccaaat tcaagttgac atggctgcaa 
30 ccaggctttc cgacgcatac ctaggacaat 
accaacactc aagcgaacca cacttctcac 
gggcctgacc ctctctggtc acaagcccac 

35 

agctgtgccc cacccttggg ccaaggtcaa 
agagagagac aaggaagacc aaggtctcca 
40 gctaaaggca gctgtgggac cagggagtaa 
caccaacaca gatggcatgg gtcactgtgg 
gccagcttcc caggggcctg agactccacc 

45 

acttctaagc aagcagcttg gaggcccaag 
gccagtataa gccactctgg agccttaaaa 
50 tcgggcccaa agactggctc agttggtgta 
tacctgatgg ccaggcacag tatttgggag 
agtttgagac cagcctggac aacatggtga 

55 

gatgggtgtg gtggtgcaca cctgtaattc 
cacttgaacc tggaaggcgg aggctgcagt 
60 tgggtgacag agtgagaccc tgtctaaaaa 
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gccaagagag 


gctcctggac 


ctagggaaac 


50520 


tgggctttta 


tgatcttggc 


tatgggagaa 


50580 


cctggcatat 


ggagcttcct 


aaagattgca 


50640 


acagaatccc 


acaagcttca 


gcacccagag 


50700 


tggataccgg 


gtatctacag 


acacagctgc 


50760 


gaagaacagg 


tgctcccatg 


caccaatgag 


50820 


gttgagactg 


agacataaat 


gagccacgtt 


50880 


tgggagatgc 


cccaccgtct 


ctggtctaaa 


50940 


tggactgtgc 


cctgcccttg 


ggctgaggtc 


51000 


caaggagtga 


cggggaaacc 


aggctatctc 


51060 


gcaacaggct 


gctgtgagac 


tgagacgtga 


51120 


atgcttcttg 


cctaggaggg 


accccaccct 


51180 


agagtttaat 


gctgggctat 


gccccacggt 


51240 


ctactgccca 


gtcaaggagg 


gatagggcag 


51300 


attcaatgcc 


ctgccatagg 


ctgctgcaag 


51360 


agagtcttgc 


ccatactgtt 


cgcctgagaa 


51420 


agctggcact 


atttttagag 


tttaacacta 


51480 


ggtgacacaa 


cagcagtcgc 


cacccaactg 


51540 


aagcacactt 


agaacaatac 


ccactgccct 


51600 


cccaccccaa 


tccattgcag 


cttccagcaa 


51660 


tttcctccac 


cactgctact 


gccacaccac 


51720 


cacacaccag gcccactgct 


cccactgccc 


51780 


aatgagccct 


ctaggaccgc 


taacacaagg 


51840 


caggctcgct 


caccccaatg ctgccactac 


51900 


ttagtttgtt 


ctcacactgc 


tatagagaaa 


51960 


ggtgaggcag 


acagatcact 


tgaggccagg 


52020 


aaccctgtct 


ctactaaaat 


acaaaaatta 


52080 


cagctacttg ggaggctgag gcatgagaat 


52140 


gagctgagat 


cgcaacactg 


cactccagcc 


52200 


aaaaaaagaa 


gaaagaaaga 


gagagaggga 


52260 
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gagagagaaa gaaagagaga aagagagaaa 
gaaagaaaga aagagaaagg aaagaaagaa 
5 aaagaaagaa ggaaagaaaa agaaagaaag 
agaaagaaag aaagaaagaa agaaagaaag 
aggaaagacc tgagactggg taatttaaaa 

10 

cggactgtac aggaagcatg gcagcatatg 
catgacagaa ggcaacatgg gaacaggcat 
15 ggggtgcccc acacttttaa acaaccagat 
agtaccaagg ggaaaatctg ctcctgtgat 
acactgggga ttgcaattca acatgagatt 

20 

gttggcatct aagtccccag ctaaaagtca 
aggcaatccc agagaccact gactttgtgt 
25 tactgccagc atgcaaaatc aaagtcgaag 
agaaaaaaaa aaaaaattcc cctaccaaag 
tacactagat gcaaagacat caatggaaaa 

30 

ggatatcatc aaaggaccat aattgtccag 
aatgccagat aaataattta aaatattgat 
35 tatctgaaaa tcaggccagg catagtgtct 
tttgggaggc catggcaggt agatcacatg 
catactgaaa ccctgtcttt ctactaaaaa 

40 

. aatcagtaaa tcaattcaga aaatgaaaga 
ttttgaagta attcagccag acaaaaataa 
45 cctttaagat atctgagatg atataaaaca 
99999^*9*9 agattcaaaa gtttagaaat 
ccagcaattt gggaggccaa agtgggagga 

50 

taagcaagaa aacaagaatc tgcctttaca 
tggctcatgt gtgtcatccc agctacttga 
55 aggagtttga ggttgcaatg ggccatgatt 
gtgaaacaaa acaaaacaaa agcttagaaa 
tcccaagaat atcaagagag ttagacataa 

60 

aacacattgc aaaaaggatc tcaccatggc 



ggaaggaagg aaggaagaga aagaaaaaaa 523 20 

agaaagaaag aaagaaagaa agaaagaaag 52380 

aaagaaagaa agaaagaaag aaagaaagaa 52440 

aaagaaaaag aaagagaaag aaagaaagaa 52500 

gaaaataggt ttaattgcct ggtggttcca 52560 

cttctgcaga ggcctcaggg agcttttcct 52620 

cttacatggt aggagcagaa ccgaggcggg 52680 

atcacacaaa ctcactcact attgtgacac 52740 

tcaatcgcct gtcaccaggc cccacctcca 52800 

tgggcaggga cacagaccaa aaccacatca 52860 

ccacaacctt aactgtacca taagtcacca 52920 

actgcgaaag aagtcaaaca aagatcacac 52980 

tatcttaatc aacaacacgt acacaacctc 53040 

caatttcaaa aaattggaac aagcaactgc 53100 

acacaggaag catgaaaaag cagggaaatg 53160 

caacagattc caacaaaatg aatcccttga 53220 

ttttaaagaa gcttaatgag atgcaaagga 53280 

cgtgcctgta atcccactgt aatcccacac 53340 

aggccaggag ttcaagacca gcctgggcac 53400 

tacaaaaaaa aatgtaagct aatacaagaa 53460 

aaataaaatc tcagtacctg aagacatgtc 53520 

ggaaaaaaga attacaaagg ataaacaaag 53580 

accaaactta caaattatca gtacacccaa 53640 

ggccaggtgt agcggctcat gcctgtactc 53700 

ctgcttgaga tcaagagttc aagactagcc 53760 

aatttttttt tttaaattag ccaggcatgg 53820 

agggctaagg tgggaggatt gcttgagccc 53880 

gtgccactgc actccaacct gggtgacaga 53 940 

acatatttga ggaaataatc aataaaaatg 54000 

agatacagga agtccagtga tacccaggca 54 060 

aatattatat tcagaatgtc taaagtcaat 54120 
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gagaaagaaa gaattttgtc agtaacaata 
tccattagac taacagcaga cttttcagca 

5 

ggtattttca aagtgcttga acaaaagaac 
ataggtttca taaataaagt cctttccagg 
10 tacaaatggt caccctacag gaaatgctca 
aatatccatc atcattaaaa cccatggaaa 
acacaaagga ggaaaagaaa ggaatcaaat 

15 

aagacaaaaa gacagaaaca aagaatctat 
acaaagcttc acatatcaat attaaccccg 
20 agatgcagat tggcagaatg ggtttaaaaa 
ctcaccttac ccacaaaaac acatataggt 
agcaaacaga aataaaagca agcaggagta 

25 

aaatcaaaaa cagtaagaaa gaacacagaa 
agatgtaaca atcctaaata tatatgcacc 
30 aatattactg cagaaaacca aataccacat 
gaaaactcat caacacaaag ggaacaatag 
ggacgaggaa gaggagcagt aacataacta 

35 

aaataatctg tacaatgaac ccccatgaca 
gtacccccaa acctaaaaaa acacacacaa 
40 gagaaaggca gcaacacaat aatagtgggg 
agatcatagg gagaaaaaat taacaaagaa 
atggatttaa caggtatttg tgcagtggtg 

45 

tgggttcaag tgattctcct gcctcagcct 
accatgcccg gctacttttt tgtgttttta 
50 gctggtcttg aactactgac ctcagatgat 
tttacaggca tgagtcactg cgcctggccc 
accattcccc aagatagact acatgttagg 

55 

aattgaaatc atgtcaacta tcttctcaga 
caagagaaat ttcagaaagt atacaaatac 
60 tcactgggtc aatgaagaaa ttaagacaga 
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caaaagtttc tagtcaccta 


taaagaaaac 


54180 


gaaatcttac aggccagaag 


agaagggaat 


54240 


tatcagccaa gaattttgtg 


tcctgccaaa 


54300 


caagcaaaca ctgagggaat 


ttgtcacccc 


54360 


aagaggtctt aaacatgtga 


atgaaaagtc 


54420 


tataaaactt acagtttgta 


taaaacaatc 


54480 


ggcaacatga caaaatttta 


ctaaaccaca 


54540 


aacttgaaat cgatcaacaa 


tatgacagga 


54600 


aaagtaaatg gactaaatgc 


tccactgaaa 


54660 


ataatccaag tatatgctgc 


ctacaagaaa 


54720 


tgaaagtaaa gaggtggaaa agatattcca 


54780 


gttatcctta tgtcagataa 


aatagacttt 


54840 


gggcattaca tagtgataaa gtgatcaaga 


54900 


caacattgga gcacccaaat 


ttacaaaaca 


54960 


gttctcactt ataagtggga gctaaatgat 


55020 


acactgagat ctatttgagg gtgaagggta 


55080 


ttgggtactg ggcttaatac 


ctgggtgatg 


55140 


taagtttacc tatgtaataa 


accttcatgt 


55200 


atattactac acctaaagaa 


agagaaagac 


55260 


gacttaacca ccccacttgc 


agcactagac 


55320 


acattggcct taaattggac 


tttagaccaa 


55380 


caatcttggc tcactgcaac 


ctccacttcc 


55440 


cccaagtagc tgggactaca ggcatgcacc 


55500 


gtagagacgg ggtttcagca tgttggtcag 


55560 


ccaccctcct cggcctccca aaatgctggg 


55620 


agaatatgca ttatttttat 


cagcacatgg 


55680 


ccacaaaact agccttaaga 


aatttttaaa 


55740 


tgacaatgca ataaaggtaa 


aaatcagtac 


55800 


atggaaacta aacagcatgc tcctgaacaa 


55860 


aatttaaaat ttttatgaaa 


tgaaaagaaa 


55920 
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aacacatctt accaaaacct gtgggataca 
gcattaaatg cctacatcaa aaaaagcaga 
5 tctcaaggat gtagagaaac aaggagaaat 
aagatcaagg caggactaaa taaatcagaa 
tgaaaagttt gttccttgaa aagataaaat 

10 

gagagaagat tcaaataaac acaatcagaa 
cagcaataca aaaggctctg acataaggat 

15 ctcactgtga aaaaaaatgg aaaagtgata 
gtattaatga atgagttctg ttatagggaa 
agagacagtt gaaacacact tcaaaattat 
ttatccagca tatgcacgtt acactggata 
cacctatggc ctgccccata cctatgcatc 

25 tggagactgg ctgaagatgc agcatcaata 
tgaggctgca gtgagctatg atcacgttac 
aagacctcca gagtggatca aggggatata 

30 

ggtttggaca ataataatct cccttgtatg 
cttctataaa ctcaattaaa aggtgacaaa 
35 ggtcctgaat gtcccacgtg cactagggct 
tcttattatt tctgtcaata cagtttggtt 
agttacttta tagagttaga caaatgctaa 

40 

tagccataaa atataatgaa aaaggaaaaa 
gaagtacact ggaaagcctc tataattaaa 
45 atagaccaat agaaagtaaa agaaagacca 
ggctgtgaca aaggtggcat ttcaaatgcc 
tgccctgggg cagccaggta gacttttgga 

50 

cattcacaaa aaaatacatt ttacatgtat 
acaagtatta gaagaaaaca cagctaaatt 
55 tcaaacaatg actcaaatgc ataggtaata 
aatttttaaa atttttcttg acagaaaacc 
aattggagaa tataccaaag acaaagacct 

60 

aaattaaggg acaaaggtcc aggaatctga 
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gtgaaagtag ttctatgagg gaaatttata 55980 

aagatgacaa attaacagct taatattgta 56040 

caaacccgaa attggcagaa gaaaaaaaaa 56100 

atgaaaaaag aataaaaagg atcaacgaaa 56160 

tgataaacca ctagctagac taataaaagg 56220 

ttgaaaatga agacattaca attgatacca 56280 

ttgggtgctt acttgggaag tgacccaaga 56340 

ccaaaaaaag gaagaaagtc aatagactat 56400 

ctggtgctta atcctgttgg agactctcta 56460 

ttctcttagg ggacagagaa gctagggtag 56520 

ggaaaaattc ttgtgggatt aacttcccag 56580 

ctcggaaaag aaaggagttg tccatagatg 56640 

cagcatttgc tgcagagttt gctcagagtt 56700 

tgtactctag cttgagcaag accctgtctc 56760 

gatgctgcat caacagcatt tgctgaagta 56820 

tctccggtcc cttctagaaa gttggcacca 56880 

tttcatcttc tacaatatct ggttaaacct 56940 

tagccaccta tatgtcctca ttcagatgcc 57000 

catgttattc gtctttacaa aaagctattg 57060 

agttcatttg gaaaaacaaa catgcaagaa 57120 

ttacacagga gactaaccct acaggacatt 57180 

acattgttga aataacacat gaatagacaa 5724 0 

gaattagaac caaatgcata tagaaattta 57300 

gctgaagaga agatgagcta tttaataaat 57360 

aaaatttaaa attagaccca catctcatag 57420 

taagaatcaa aatgtaaaaa gtaaaacaat 57480 

cctgtttaat cttagtgtta ggaaaggctt 57540 

aaagataaga ctgataaatt ttactacaaa 57600 

accctaaaca aaatcaaaag acactgtcaa 57660 

aaaatcccta gtatgtaaag aactctttaa 57720 

tagaaaaaat gggtaaagaa gtaaaatttg 57780 
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ggttaggcat ggtggctcac tcctgtaatc 
tcatttgagg ccaggaattg aagaccagcc 

5 

aaaaatttta agatttagct ggccacagta 
ggctgaggca ggtggattgc ttgagcctgg 
10 actccagcct gagcaagacc" ctgtctctaa 
ctgactccag ctaccttttc atctttttta 
caacatatct ctaccttctc ttccccaaga 

15 

ctttttgttt tgaccatctt acaatgttct 
ttgttcacct tcttcaacgg taaatctgtc 
20 tgtgcaattc acacgatgta tgtcatagat 
gattttctat tggctctgac ttggatgaaa 
cactcgctgt gattctatcc ctttctggac 

25 

tgtttgaagt cttccctaat ctcttcaagt 
gtggggacat ttatctctct tctgtggtgt 
30 atacagcaca ccctgggtgc ccagttcatt 
ctttcttctc catggagatt ataccataga 
ctcattaacc aaaacaggaa ataaatacga 

35 

tggcctgcat ccaggaaata gaggacttcg 
ccagtacatt cattctctca ggaaaaaaaa 
40 gatcaagaga tacgtggctg ctggcagagc 
aactgggagt tggtgataag gtttttctac 
tgagggatag acagagtgta ttgtactagc 

45 

agaaagatag gacatggagg gggatgggag 
cattagaatg tataagtagc ctagaaaatt 
50 ctgccttggc aatggggctt cccttgagaa 
acctgggacc ttggaggatt cttgtccact 
agagaattga actcaatgtt gtccaagaga 

55 

agtactaaca tatgcaaagc agcactctaa 
catctgtaat cccagcactt tgggaggccg 
60 cgagaccagc ctggccaaca gggggaaacc 



cctgcacttt gggaggccga ggcaggagaa 5784 0 

tgggcagcat agtgaaatcc tgtctctaca 57900 

gtgcgtgcct gcagtcccag ctattcagga 57960 

gagtttgagg ctgcagtgaa ctatcactgc 58020 

aaaatacaaa caacaacaaa aaagtgaagt 58080 

cattcaatgt ccttcactgt gagccattcc 58140 

cttttccctt tccataccct caacggtgtt 58200 

agccatccca tcaacttcat ttgttcttca 58260 

atggactatt acatgcagct gtttttgtac 58320 

atgtggaatg ccctcccatg gatggcttct 58380 

atagatctaa gttgattaca agtctcattt 58440 

tgcagtttat tgatttcata tgccaccttc 58500 

ccaacaaatc attttctcct ctgttctcct 58560 

ttatttcatc ctgtctcatg ttacagctaa 58620 

tctgtgtctc ccagagcaac cagcccaacc 58680 

ctcttgtcaa atcaaaccat atttaactgg 58740 

atgaaactga gctctaagca gcatgtaacc 58800 

gatccttcta accctaccac ccaactggcc 58860 

caaggtcccc acagcaaaga aaaggaatag 58920 

aaggtgagtc tgcttaatct ggtggttggt 58980 

caagtgagac ttacggcgaa ccagtgtgga 59040 

tttattgagt ctgctccaag agaaagaaag 59100 

aaacaaaaag aaaaagaatt tttaaaacca 59160 

gagcagtgga agccacccaa tgaacagaca 59220 

ggataatctc catttaacta atattgacat 59280 

ctctcaccct gtttcctttt ccagaaaatg 59340 

ttttctagtt ctgaactttt gctttcagcc 59400 

acactatgag aggccgggtg cggtggttca 59460 

aggcaagcgg atcgcctgag gtcaggagtt 59520 

ccatctctac taaaaataca aaaattagct 59580 
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gggtgtggtg gcgggggcct gtaatcccag 
ttgaacctgg gaggcggagg ttgcagtgag 
5 gtgacagagc cagactctgt ctcaaaaaaa 
aagatggaaa agatggaccc cttgccctct 
ttctcatctg taaaactggg cgggtgagag 

10 

cttaagtgcc ctataactga tatctattat 
ggttaagatg ctccaatgag ctagaagaag 
15 tgtgagggaa aattctctcg catttatttc 
ttccggtggc caattctgtg ttggtggtgg 
gaattatgtc tcacgtgccc ctgtatccaa 

20 

ggaacccacc tagtttggtg tcgaatgtga 
aaggcaaaac cttgggggta agtgagattt 
25 agtgatggca gggggaaggg aaggcctggg 
ggctgagttg atcgcgcctt gccaagcaca 
cacaggcggc ctctgctcag accttgtatt 

30 

gaaaagcaac aaaaaagcag cacgcttgaa 
gccctgtggc ctctgaagaa aggggtccca 
35 cagaatccaa taagatcgaa tcaaatattt 
ccaacaacat aaatggtcac cgcaacccct 
gcccatttca ctgtgtagct gaatgatggg 

40 

caaccctagc tccaccctca gatagtgctc 
gagggaaatt tgtatctgcc aaatttacaa 
45 atcctccaga tatgcctgtg cccacaaaaa 
cgtggtttgt gataacaaaa atattgtaat 
atattggtta agtcagtgta atatactggt 

50 

gtggagctat gaaaagaaca agaaagctct 
atggagttaa tggaagaaag taaggggcag 
55 gaatttaagg aagtaatata tttgtatttg 
agatgttctt tcatttacaa tgggattatg 
tcataagtaa aaaatacatt taatacatgt 

60 

ctaccttaaa caagttcaga acacttaggt 
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atactcagga ggctgaggca ggagaattgc 59640 

ccaagatcgt accactgcac tccagcctgg 59700 

aattaaaaaa taaacactat gagagacaca 59760 

ggtagccttt agcctctttg agtcttagtt 59820 

gatgaaagga aatgtatttg taaagcacct 59880 

ttgtaacatg ggattcataa acttgtagag 59940 

caaaagtcct ttttaaagga caagtccata 60000 

ttggcagcat gaattcgatg acttcagcag 60060 

caccccacaa tggttatcct gtgaccccag 60120 

acagccagcc gcaagtccac ctagttcctg 60180 

atgggcagcc tgtgcagaaa gctctgaaag 60240 

ccctttgcag gccacagact gcacagctgg 60300 

aaatacacaa acactacatc ccgcaagtgg 60360 

ggtctattag aaagctcagc tgcacaacat 60420 

ttctgattta ggcattttcg gaaagagaaa 60480 

ggttgatcct gttgccagga tccacgccca 60540 

gcaagaccaa aatagaccac aaggtggagt 60600 

gcccaacagg aaacaaagca ggttaggcaa 60660 

gagaggctgt cctgcgctgg acaaagaaag 60720 

catgaatttg gatgcagact gcccgagctc 60780 

atgggaacaa aaattgatac aacccttagg 60840 

aatatccaat aattccattg ctggaacttc 60900 

ctgtgatatg tatgagctta ttcattgcag 60960 

atccatcaat ataataccgg ttaagtcagt 61020 

taagtcagtg catccaaaga ataaaatgta 61080 

ttgtgtaatg atagggacac accccaagat 61140 

acactgtgta taatttgctt acatttatgt 61200 

ctaccacgtg tgtaatatac aaacatacac 61260 

tcccagtgta cccatcgtaa attaaaaata 61320 

agcctaccaa acatcatagc ttagcctcgc 61380 

tagcttacag ctgggcaaaa tcatctaaca 61440 
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caaagcctat ttaatagtgt tcttgaatat 
aatgaaaagc agaatgattg tatgggtact 

5 

ctttcacacc ataataaagt tgaaaacttt 
cccaggctgg agcgcagtgg cgcagtcttg 
10 agcgattctc ccgcctcagc ctcccaagta 
gagctagttt ttgtattttt gtagagacag 
aactcctgac ctgtgatccg cccgtctcag 

15 

gccactgtgc ctggcccaca tggcattttt 
acccactcta ctccagtgtg acctcatctt 
20 caaatagagt cacattctga ggcactcagc 
999^99999 agg99*gcat aatttaacac 
aggcaaagat ttacatagaa ggatgtttat 

25 

aaataatctt acctgtccaa taactctgtc 
atctgagcag ccctgaaaaa acatatgttc 
30 catgatgaaa tattaagtta ataaaattca 
tttgaaaaga ataatgggag aatacaccaa 
gggagattac aggtagtttt tatttcattt 

35 

gaaaataaaa catttattaa aattaaaagg 
agaaacaaga cttcagcttg tggctctcct 
40 ctcacatcgg cctcggctcc atcatggcga 
cattctacgg aggctttccc ttctggggag 
ccgatgagct cccagaaggt gccaaggcac 

45 

tgccctgcag aaggttccca gccacccaaa 
cctcgttccg caagcatatt agtttggtag 
50 ttggatggct taaacaacag aaatttgtta 
atggagatgt tagcagatgt tttcccctaa 
cttctccctg agcctttcca cggccttcct 

55 

tgaggacacc agtcattttg gattagggtc 
cggttttaac aaccctatct ccaaacacag 
60 tcaatctacg aaatttggtg ggggatagag 



ctaatgtagt ttactgaaca ctggactgaa 61500 

ccaaatacgg tttctactga attcatgtca 61560 

gtttttttga gacagagtct cactctgtca 61620 

gctcactgca acctctgcct cctgggttta 61680 

gctgggatta caggcaccca ccatcatgcc 61740 

ggtttcacca tgttggccag gctggtcttg 61800 

cttcccaaag tgctgagatt acaggtgtga 61860 

tataaggaca ccaatcatat tggattggag 61920 

ggctgattat atctgcaaca actctgtttt 61980 

attagaattt caacacatga atttggagga 62040 

ataacaatat cctaaagaaa taagagatga 62100 

cacaatgtta catataatag aaaaaagtag 62160 

tcaactactc cacagaccat ccataatggg 62220 

caagaatact taatgccatg aggacacatt 62280 

gggggccatg ttgtgctatc aaggtatata 62340 

aatgtaaaca gttgttttct ccaggcggag 62400 

ttgtatttcc catattactt tttagaccat 62460 

aggctcatag gggaggcagg acccagcctc 62520 

gacaggccat ccagatcatc attggcctgg 62580 

cggttctcgt aggggaatac ctgtctattt 62640 

gcttgtgggt gagtaactca agtcctcctg 62700 

tcaaggccac cttgccaagt tgtgcagctg 62760 

tgtgggggat aggaaaccag ccagccctgg 62 820 

ggccggccca acagagtata gcaccacagg 62880 

tctcacagtt ctggcggctg aaagtctgcg 62 940 

ggtctctctc cttggcttgc agaaggtcgt 63000 

ccatgcctgt gccctagtct cctcttccta 63060 

ccccctgatg acctcattta acctgaatga 63120 

tcagattctg gagcactaca gattagggct 63180 

ggtgtggaca caattcagcc cataatatca 63240 
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agctacaacc ccgcaaggtg acatcccctc 
tttttttttt ttttttgaga tggagtctca 
5 caatctcggc tcaccgcaac ctccccctcc 
ccctagtagc tgggattaca gacgcgcacc 
tagtgacaag atttcaccac gttggccagg 

10 

cacttgcctc agcctcccaa agtgctggga 
aagggatgtg ttttcttaat tcacaaagaa 
15 tagcccctgg cattgttgga cataggccca 
gcaaaggtga agagtggctt cccagctgta 
ggggactcaa tctagatctg cagatgggct 

20 

tatgctaaat gtgtattatt tttctggaag 
gatccgtgac tccataaagt gttaagaact 
25 tttatggttc ctttgcagtg tccccctgcc 
cacacacata caaacaaaca cacatacaca 
ctctctcctt ctcctcccct ctccaccttc 

30 

cttcctcctg actcacacac ctacatgtga 
aactccagag taacctaggt gccagcccaa 
35 tagcctagag ctatctccat ccccgacagc 
tctagcgagg ctctgcacac acccaaactc 
ctcacaagat acctgttcct gtatctctga 

40 

ctagcatcac ggccacccag gcctaccatt 
attccaccaa gtgccagtga gcagcaggca 
45 ccacgcctcc tcccaccagg ctgctgagtc 
ctctgcaggg cagttgtgct ctttgctgcc 
ccaaagatca tcaatatcct cttctgaaac 

50 

gactgagagg gataccaaaa gtccagccct 
ggtgtgagcc tgagatttgt ttttccactc 
55 cagaaacagc tggaataagt tttcactgta 
tctgccgtgg gtgggaggga gtatggagtg 
ctgtctggat ttgattctca aatgcatgtg 

60 

aaactccgct ggatgatgtt gtgtgagttg 
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aaaagggatg 


tattttcttt 


ttactttttt 


63300 


ctctgtcgcc 


caagctggag 


cgcagtggtg 


63360 


cggattcaag 


caattctcct 


gcctcagcct 


63420 


accacgcccg gctaattttt gtatttttag 


63480 


ctggtctcaa 


actcctgacc 


tcaggtgatc 


63540 


ttacaggtgt 


gagccaccac 


acccggccaa 


63600 


aacatccaca 


tgtaaaaaat 


acactgggaa 


63660 


caagcctgag ctccaggaga 


aaaacactca 


63720 


gaacagcaag ggtttgagtg cttgagcact 


63780 


ttgggggctc 


cattttggca 


aaatacctaa 


63840 


ggtccatagc 


tttcatcaga 


ttatcagagt 


63900 


acagtccttc 


ttccacatca 


gtttattacc 


63960 


cacacacaca 


cacacacaca 


cacatacaca 


64020 


cacacacaca 


tacacacaca 


cactctctct 


64080 


ctttctcctt 


tctctccctt 


ctgttacagt 


64140 


cactaatgca 


actttcctca 


taaatggcac 


64200 


gtccctctac 


cttataagag 


ttaaataaaa 


64260 


cacacaagtt 


ctgcaaatga 


acagggctcc 


64320 


agaactcagc 


ccctttccct 


cccacaacca 


64380 


ccttcgtgga 


atgtggggct 


atggagaggg 


64440 


gagcacacaa 


tgcctaccac ggagcactgg 


64500 


gttgtcctgg 


catctcaaat 


gcctcactga 


64560 


ccttcagccc 


tcatgacctt 


cctccttttg 


64620 


ttcttgatcc 


caggactgtc 


tttctatttc 


64680 


aaaggctcat 


ctcctctggc 


acctcagaag 


64740 


tcctccccat 


caaatctcta 


taaacagaag 


64800 


ataggagaat 


atggccttgg 


ccttggaagg 


64860 


aataactatg 


tccactctat ggcacatgca 


64920 


agggttagga 


acatggagtc 


tgaaatcaag 


64980 


ccatagagtg 


gaaacactta 


tttacaatga 


65040 


ttcaaaatct 


ctaagcctca gtctcctcat 


65100 
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tgataaaatg gcaagagtaa tactgcttga 
ttgaacaagt acttcatctg ttggggggag 

5 

cttcttttgg ggccgttgca aatgttctaa 
ataaatatac taaaaataat tgcataatat 
10 aattatatgt caatgaagcc ttttctaaaa 
agctcctatt aaatatagct catgccctct 
tgatggcagg caatcccatc agccacagtg 

15 

cgcaattgac atcatcatcc tcgtatttga 
ctgagtaact aaaagttatt tgtccagata 
20 ctatctaaaa gcagcagaga agacgctggt 
aaacctaact gaatcttgat ataaagattg 
aaaatacaag gaaataatcc aacatttgag 

25 

ccagatgtag ggcataagtt gtctcatgac 
ccattttgta gatgaggaaa taaagactta 
30 gtttctagcc agaatttaaa accaggtttg 
ccagtgcctt ttcaggcagt gcaaaacagc 
tggttccagc aaaggtaccc actgtgtgtg 

35 

ccctagcccc tgacctcttt ctaaacccac 
tctctctccg tggcagcaga aaatcagcca 
40 agaccagctc ttccaactgg agacctatag 
catacgattc ctcccccagc cttgcaccta 
ttcccgctca tacagactgg ccacaggagc 

45 

caagaggatg cagaagtgag aaggggcaat 
ggagagcgag agagagagag agagagagag 
50 gagaggcaca gagctcccag gggcccaggg 
acttccagta cagagaagag gactccgggg 
ctaaggaatc cagccaacag aactctgata 

55 

agcaacagac caagaaatac cttcatttgt 
gaccaggtcc ctacaaccag atgagatcta 
60 gtggaagcgt atctttctgc acatggccca 



PCT7US01/48437 



tgggccacca 


tgtggttaaa tgaggcgagg 


65160 


ggcggtgatg 


ggttgctagt cggtacgtgg 


65220 


cttacattgt 


agtgatgatt gcacaacttt 


65280 


actttaaatg ggtgaattat atgatacatg 


65340 


aagcattttc 


tatagtgccc cacaaatggt 


65400 


cataccactc 


ttgctagccc actgggcaaa 


65460 


cctgcggaat 


accatgttgg atgaaatgca 


65520 


tgttcagcat 


catagcgaca caatgtttat 


65580 


tgcaggcagg 


caaggattga tgtcaggctt 


65640 


aaccccttcc 


ttctcccctc cagagccccc 


65700 


gaaaacaata gggtgtactt gtgggaaaat 


65760 


catctactaa 


gtgtaaagat atactaaatg 


65820 


aatgacactt 


gggaagagct ataaattttt 


65880 


gagaagttaa 


taacttaccc aaggtcaaga 


65940 


ctggatccta 


gaatatatat ttatcccacc 


66000 


tcattttttc 


tgttgcccat gttgagctga 


66060 


accagaatca 


cagaagcctg gggaagggca 


66120 


tctgttctgt 


accagtttat catttcagga 


66180 


tattcttatt 


gcctggtaag ttacattctg 


66240 


aatggtgtca 


caaaggggaa aataaggtca 


66300 


ctataaccag 


acacacggtg ctcacacaca 


66360 


ctggccctgg 


agtcacatgt ggaaaagtcc 


66420 


acagtcacct 


tagttaaggc atcagagtat 


66480 


ataacagctt 


tcctaagaag ccctgccaca 


66540 


ctgccctcta gcagaccagg agccaagagt 


66600 


aagccgaagg 


aggctcagtc actcgctttc 


66660 


atcaacatgg 


acaaagcttt gtgctcccct 


66720 


ctcataatca ggttacgaag ggtaaagggt 


66780 


tttgaaagtc 


ttcaaaagaa agctcaaaat 


66840 


gaatcatgga gcatctgagc ttatcacatc 


66900 
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ccaagacact tgggctaaga ttctatttat 
agagattatt cttttttctt tttctttttc 
5 tgtgtgagac agagtctcac tgtgtccccc 
cactgcagca tctgcctcct gggttcaagc 
gggattacag gtgcacacca ccatgcccag 
attttggtag agacggagtt ttgccatttt 
ggtgatccac ctgcctcagc ctcccaaagt 
15 cagccccaaa gattattctt gagaacactt 
catgaacttg ggaaaatctc tatgtattgt 
aaaggctctg agaagtccca ccacgatgat 

20 

caaagctaac ttaccccaaa acctttcttg 
tagggttcct gggattcaca ttaagaaaca 
25 tggtggcaat ggttgcacag cattgtagat 
aaaatgatta ggatgttaaa ttttatgtta 
ggggaaaaag aaacacttgt tgggtctcca 

30 

catcctgacc ctctgtgtct tcttcagctg 
gcaatctgct ctgcagttgg agtcatactc 
35 tatgcctacc ccgactatta tccttacgcc 
agatcctcta agttctgaat tagctacatt 
aagctacctc actgaacatg gtaggaaaag 

40 

aagtccttga cccatagctt taagcacatg 
agaacaatac ctaccttgtg gcaatgttga 
45 atatgtgccc accaaccagc acctctcctc 
agccagaggt tagcaaactt attctctaaa 
ggcctacagt ctctgtcatg tagctatagc 

50 

caaatgaata tggctgtgtt ctgataaaac 
ttgacccatg gatcataatt cacccacctc 
55 atgcaagcca catgttattt taaattttct 
aaagcaacag gtgaaattga ttttaataat 
attgtcattt caacatgtaa tcagtataaa 

60 

ctaaatcttt gaaatccagt ttatattttg 
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cctatgggac 


aaataggtca 


agtgtttccc 


66960 


tttttttttt 


tttttttttt 


tttgtgtgtg 


67020 


aggctggaat gcagtggttt gatctcaact 


67080 


aattctcctg 


cctcagcctc 


cctagtagct 


67140 


ctaattattg 


tattttttgt 


attttttggt 


67200 


ggccaggctg 


gtcttgaact 


cctgacctca 


67260 


gctgggatta 


cagatgtgag 


catccgcgtc 


67320 


ctcccatgat atgctcaatg aagatagtta 


67380 


ttcctctctg aaagatctaa agtcctcatt 


67440 


ctgtttacct 


ttgcccaacc 


cagctatttt 


67500 


cacagaacac 


ccctttactt 


gcagtggaaa 


67560 


ctttttgatg 


aaaagagtta 


tagagatgga 


67620 


gtgtttaatg 


ctactgaact 


gtaacacttg 


67680 


catatatgtt 


aacacaattt 


tagaaattgg 


67740 


ttcatctcag 


ctataacagc 


ttttctcgcc 


67800 


tctggcagtt 


tgggcttgaa 


catcgtcagt 


67860 


ttcatcacag 


atctaagtat 


tccccaccca 


67920 


tggggtgtgg 


tgagtatccc 


tctcaaccaa 


67980 


tagaaaactc 


ccagaaaggg 


cttgacaacc 


68040 


gaggtgcttt cagcatagga 


acaaatttta 


68100 


acctttccca gtttgggttt 


ctgcctttgt 


68160 


gagaattaaa 


tgcaatttta 


caaagaatgc 


68220 


acattctctg 


ccttcttgcc 


cattactata 


68280 


gagccaatag 


taagcagtta 


aggttttgca 


68340 


tgccttacag 


cagccacaga 


aagcactaaa 


68400 


tttatttaca 


aaacaggcag 


caggcaggat 


68460 


tgctatacac 


tagcactgtc 


caatagaaat 


68520 


agttgccaca tttaaaagag agagagagtg 


68580 


ctattttatt 


taaccccgta 


tatctaaaat 


68640 


aacttggttt 


gctattttgc 


atttggggaa 


68700 


tgctatggca 


catctcgatt 


tgggaccagc 


68760 
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tacatctcaa gtgtttaaaa gccacatggg 
tacagatgag ctacctgtgt cccacgtaaa 

5 

tcctctccta cctgctcaag ttctcctccc 
atcagtcttt ccccctttgc tggggctttt 
10 aaaaaacaaa ccaacccacc tcctcttcgt 
tctctggccc tctttgcaga caaattctta 
tcctcttctc tcatattctc tctgaacctg 

15 

aaacacctgc tcttctccaa atccccaatg 
ttgcttgact tccaggctcc accctcactg 
20 tctgtttttc ccaatctctt aggctcaata 
tttgtctatc tcatttagtc tggcaacttg 
atacatggct ccagccaaat cttctcccct 

25 

ctcaatagct catagttcat acatccaagt 
ctctactcag ttttcctgat tccagttgat 
30 aaaaccttgg agtcgtcatt gaccctgtct 
gcaatcctgt tggttctact ttcaaaatgc 
ctctactccc acgctggcct gagccaccgc 

35 

tgactgttcc ctttctccta ctctgggctc 
gggatccttt ggaaacacaa gatcccatca 
40 agccaatgtc cttccagcag cctgaagatg 
ctgctgccat cctctggttc tgctgctgaa 
cactccaggc atctttccac ctttgggcct 

45 

cctcagatgt tcccacagct tcctctgtcc 
caaccacctt cccaatgagg cctacctgaa 
50 cgcttccctt ccccttatct atctttcctg 
gacttatttg ttacatttca tgcttactgc 
ggatgttttt ggtttcactc actgtcatac 

55 

ggttgacact tcataagcat ttgtcaagga 
tcagtaaaca tcaacagcca ttaactgttg 
60 agagacttaa agacttatct ttccaagatt 



aattgtggtt gctgaataga acagggcagc 68820 

ttcaatcctc tgtctgcgca actgagtccc 68880 

ctctccctca tcccgtctct cctctacatc 68940 

gcatcagcat taagcatgct tttgtttctt 69000 

tgtacttcct ttatccacta ccactgtatc 69060 

aaagaattgt ctacacttta gtgagtccaa 69120 

ctccaattag actttcagcc tcacttccct 69180 

gggtgattga tccttcttct cggagggact 69240 

gttgccctct tctctgtctc ctctgctagt 693 00 

attggttctc tggttttctt tctgtactca 69360 

aaacaccttg tctatgacaa ttgttctaga 69420 

atacaccagt ctagcatgtc caatggtcaa 69480 

ctgatctcca gatttcactt cccaaacctt 69540 

ggcaactcca tccttccagt tgctcgggac 69600 

ttctctcaca cctcacagcc aatccctcgg 69660 

attcagaatc ccaccatttt cccccagccc 69720 

atctctcatc taatcatgga aacagcctcc 69780 

actgcagctg tttatagcat ggccaccaga 69840 

cttctctgct ggcatcttac tcagaaaaaa 69900 

ctggccctct ggtgcccccc gcccctggct 69960 

ccacgggaac ctccatgatg ttttctccaa 70020 

ggaatgttcc ctccgactgg ggaactcttc 70080 

cttcctcacc tccttcagat cactaatctg 70140 

caacccaaca cccacactct cccagtaacc 70200 

tagtctatgc atctaacacc ctctgcaact 70260 

ctgtccctag aatgagattc ttaaggacag 70320 

ctcaaatacc tagaactgtg tctggcacac 70380 

ataaatgaat gtgccagctg tgatgcatac 70440 

tttgtatttg accagatcct tttatcttca 70500 

cttaggcaaa gcctctccct agagatccta 70560 
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tggaccaagt ccatgtcatt tatcaaaagt 
ttatggctcc cagatacttt aggattaaag 
5 cctgggggat ctggctctat ctcactgtgc 
ctcaccctga accccagcta ctcccgtcca 
ggaattttcc tgctgggaaa gaaccaatca 

10 

taagaagcaa atgctggtgg aggaactaag 
agactaaacc ccaagcagca cagccaggtg 
15 ttcagaaaac ttggctttta atcttagttc 
gccccatgcc caccacccat tccaccttca 
gtcggactgc atctgagctt tgcaaataat 

20 

ccacatcagg aattattttt catcaaataa 
ttccatgaaa tattttatgt gtaaaaaaat 
25 atactttgcc caggccattc tactctaagt 
actccccttt tcctgggaca actgcctcta 
atgaacctct ccctcctcag tgctctggga 

30 

tatctgatgt gaattataat tatcatctgc 
ggatgcattt cttagccgct tctaccaagc 
35 aggaaggaga agaaaagggg ctaaggagag 
tctggctgtc accaatgctg tttttaaaaa 
gtggctcaca cctgtaatcc cagcactttg 

40 

caggagttca agaccagcct gggcaacatg 
aattagccag gcatggtagc gcgtgcctgt 
45 agaatcgctt caacccagga ggcggagctc 
acctgagtgg cagagcaaca ctctgtctca 
gagaggcatt tctttggtac ctactgcatg 

50 

gtcacataat tggaacccaa atggtaatgg 
actcgcttcc cagatacact ggaagactcc 
55 atggagggtt tctatttctc ccggttacct 
gatcctttca ctgactgttc ttcccaagcc 
aaaacaccac tgtcacaagc caagaaagtc 

60 

cagatgaagg ggtgggttgc cccttcacac 
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accttgagtt tctcccctcc cagaaacatt 70620 
tccaaattcc tcagccaggt atttgccctc 70680 
cctacagttc acctcccttc gaatctctca 70740 
ctgcgagaga caggcacaag cgagtacttg 70800 
gggaggcatg gactgcccaa tagaagaaat 70860 
ggaaaggaag atgggagaag agtttaggaa 70 920 
ccctggagta gagggagcag caatttttga 70980 
tgccaatgat tgatcttgga atgagaactc 71040 
agcctcagtc atcccatctg aacatggagg 71100 
gacccaagga atggagatac tccggggttt 71160 
aaaaaatctc agctgcttga gggacccctt 71220 
tttatttaaa aaaaattgaa aaagaccagc 71280 
ctctgtgatc cgttgactta gacatcctgc 71340 
ccctcaccct ctctaaatcc cttcatgaca 71400 
tacctttttg ctgttctgta gctcttaccc 71460 
atctgttttc ctactagatt gtgcctctca 71520 
agatgtttaa taaccacttg ctaaatggag 71580 
gcctctccag aagagagaat tctccaagca 71640 
tcataattga gataattgag gccggtcacg 71700 
ggagaccgag gcaggtggat cacttgagct 71760 
gcgaaatccc atctatacaa aatatacaaa 71820 
aatcccagct actcgggagg ctgaggcagg 71880 
acaatgctga gctcatgcca ctgctctcca 71940 
aaaaaaaaaa aaaattaaaa aatcataatt 72000 
ccatccacct tcactatgaa atgttctcat 72060 
gagtctcctg gatggacctt ttagattgaa 72120 
atactcctgg gtattcttgg gtaacagcaa 72180 
cctgctatgg ttaatttact tcattttggg 72240 
agagaaacag ccccaattcc tgccctccac 72300 
ttttcagagg ctgacatttt cccaaaggtt 72360 
ctgtgggtgt ttctcgttag gtggaacaag 72420 
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agacttggaa aagaaagaga cacagagata 
ggaccggcgt tcagcataca gaggatccac 

5 

tttattgatc attattgggc atgggtaata 
aacacgtgaa caaatgtctc tgcatcataa 
10 gatgtgcata tacataaaca tctcaatgcc 
cacctccagc cctaaggcag ttttccccta 
acaccgagac attccattgc ccagagacga 

15 

actgcaaaga gccgttcctt cctcttttac 
tgttgggctg ggggacggtc aggtctttcc 
20 catggggaga aaccttggaa aatacctggc 
gcagtgtttt gtgtctctgg gtacttgaga 
gatgactctt tttttttttt tttttttttt 

25 

ggccggactg cggactgcag tggcgcaatc 
tcatgccatt ctcctgcctc agcctcccga 
30 gcccggctaa ttttttgtat ttttagtaga 
tctcgatctc ctgacctcat gatccacccg 
gcgtgagcca ccgcgcccgg ccggtgatga 

35 

tgtttaacaa agcacaccct gcacaaccct 
catgtttcag ggagcacagg gttgggggta 
40 aagaattttt cttagtacag aacaaaatgg 
acacagtaac aatctgatct ctctttcttt 
acatcagagc ccagcaaacc actgattact 

45 

ttcatttttc acaggaggcc ttttcctgtg 
ccttcttttt cctctaacca tctccctgtt 
50 cctaccgacg gcctccccag agcagtgtca 
ctcaggtctc cacacccaaa gcaacctatc 
ttctcagagg aacactacca agtaaatgtt 

55 

atacttacaa aaacccggtg ggaaatttcc 
tccgcaaagc cacaaagagt ccatggcagg 
60 gtcccgtgct cctgtcctcc ccatctgctc 
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aagtacagag 


aaagaaaaag 


ggggcctagg 


72480 


gtgggcactg 


gcctctgagt 


tcccttagta 


72540 


ggataatagt 


ggagagaagg 


tcagaaggta 


72600 


acaaggtaaa 


gaaaaaagtg 


ctgtgctttt 


72660 


ttaaggagca 


gtattgctgc 


cagcatgtcc 


72720 


tctcagtaaa 


tggaatatac 


aatggacttt 


72780 


gcaggagaca 


gaagctttcc 


tcttacctca 


72840 


taatcctcct 


cagcacagac 


cctttacggg 


72900 


catcccatga 


ggccatattt 


cagactatca 


72960 


tttcctaggc 


agaggtctct 


gtggccttcc 


73020 


ttagggagtg 


gtttgagatt 


agggagtggt 


73080 


ttttgagacg 


gagtctcgct 


ctgtcgccca 


73140 


tcggctcact 


gcaagctcca 


cttcccgggt 


73200 


gtagctggga 


ctacaggcac 


ccgccaccgc 


73260 


gacagggttt 


caccttgtta 


gccaggatgg 


73320 


cctcggcctc 


ccaaagtgct 


gggattacag 


73380 


ctcttaacaa 


gcgtgctgcc 


ttcaagcatt 


73440 


taatccatct 


aaccctgact 


tgacacagca 


73500 


gggttacaga 


ttaacagcag 


ctcgaggcag 


73560 


agtctcctat 


gtctacttcc 


ttctacacag 


73620 


tccccacatt 


cagagggatg 


gtggaggttt 


73680 


gaagataaga 


catccttgct 


ttgggaagtc 


73740 


gcctccccct 


tcccaccatg 


gcgctcaact 


73800 


ctctctgtgt 


cttgtccata 


ccctccaatt 


73860 


ctgctcccct 


ctctgactcc 


agagcacacc 


73920 


aaggggcttc 


cctaaacatg 


gtcatctttt 


73980 


gcaataacaa 


ttcttttacc 


cccatacttg 


74040 


ccccacgagg 


ctgagagagg 


ataggggact 


74100 


gccccagtga 


gaggtcagtt 


cggactccca 


74160 


tgcctacaga 


accctggaat 


ggcgatttct 


74220 
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ggcgtgctgc tggtcttctg cctcctggag 
ggctgccagt tggtctgctg tcaatcaagc 
5 gaaagatgcc cccaaaaagg tggagacaag 
aggagcagga gtagttcctc ggtgtccgag 
atgccccctg tgtgcctgtg ttttccaggt 

10 

aaacccagtg atcaccccag aaccggtgac 
agcaaataag taaggctaca gattctggaa 
15 cctccctttc tgggcttcca taacccaggt 
tctcccactg tttgtactct caccttcatt 
tctctatcaa gaagaagaca gagattttaa 

20 

gcacatgcat cagcacatat gtgggcatcc 
gtgtgctctg ctgcatgtga gcttgtgggt 
25 tcactctttc aattgtgcat tcatttaata 
gctctcttca tggcaggagg gggctggaca 
acttctggcc ttaggagaga ggacaccagg 

30 

ggcaacatag gggagaggca gaaggagccc 
aaggaaggaa tgaggagagc agaggacagg 
35 cattaaagga cagtaaggat tttccgtgta 
agagagaatg atcagtcctt ttagattctg 
tgcaagaatg tagaaggaat cttccagaga 

40 

ggactaattc caaagagaaa tggccatgtc 
gtaacagcat ctaaaaggac taatcctttc 
45 ccaaacagcg tcgtccctca tactgaaaat 
tgtcgcttcc tgaaacacgc aaaatttaaa 
caagaagttg cattctgtgg ggttatagca 

50 

tttccttagc tacagaagca ctgcaaggat 
aatacaaaat gtgattttgc tcttgtaaat 
55 tcttctaaat ggcatgcagt tagaggtcta 
tctgatcagg aatccttctg atgcccatca 
ctagaactaa atgggcctta ggaattacag 

60 

aatatgaggc atgtaatatc ctggttctag 



PCT/US01/48437 

tttggcatcg catgcgcatc ttcccacttt 74280 

aatgtgagtc ccagggttcc tcagtgggtg 74340 

ggagctcttt gtgctttcca ctgcctgctc 74400 

gcctttggtg catggcccgt ccttcttagc 74460 

gagtgtcatc tatccaaaca tctatgcagc 74520 

ctcaccacca agttattcca gtgagatcca 74580 

gcatctttca ctgggaccaa aagaagtcct 74640 

cgttcctgtt ctgacagctg aggaaacgtc 74700 

cttcaattca gtctaggaaa ccatgctgtt 74760 

acagatgtta accaagaggg actccctagg 74820 

agcctctggg gccttggcac acacacattc 74 880 

tagaggaaca aatatctaga cattcaatct 74940 

aatagatact gagcattcaa tgtgttcaag 75000 

ttgacatgac acagtcttgg acctcaagaa 75060 

acatcacggg agcaattttc actgacaaag 75120 

agagtgaatt tattctggct gtgatgcaga 75180 

tctggttggg gtagcaacat ctgaatgggg 75240 

ataagggacc acgtggttca ggaggccttc 753 00 

ttaccaatga ggcaaccaaa ttcacgaatg 75360 

cataaggcca atacatcatt atctgtaaaa .75420 

attcctatta ccatttctgc atttaccgtg 75480 

agcagacgac tcactgtctc caaaggtctc 75540 

ccctcactgt cctttaatgg cccatttaga 75600 

cattgttatg tatgataacc aatgggcatc 75660 

atcatctcta aaatccagct gttgtcaagc 75720 

ttgtttgttt gttgtttgtt ttggttacta 75780 

aactttgttc ttggggttat ctaagaagtt 75840 

tcgcctgtgc acctgcatct tctcattttc 75900 

gctagccagg aacaggagtg ttttcatttc 75960 

acctcctgaa aaattgattt tttcaaatgg 76020 

atcagtcctt cacatacctg ttttgtaagc 76080 
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caggaggtta ggacattgag ctccccaaaa 
ggctcggtaa acagagcaag cctgcccctc 

5 

gctgccccat ctaactccac acctccttct 
accctccatt ctgttcacag agcctccttc 
10 atattgctaa catagctcgc ttcctgcagt 
tacatctggt gcctctccat atgggggcac 
gcttgttgga acagattctt tcctatctta 

15 

cccttgtcct ggggggacca attgccaacc 
tgcattctct gcaccacccc acaagccctc 
20 actccataac aatggaacat aaaacccgga 
ctatttaacc atagcactgt ggcagtgcaa 
tgcaatttaa ttgccattgt aatggtattt 

25 

cataaggcct ctgccctcat gaatggagta 
cacaggagtg ggcttctgat agaaggataa 
30 gtgctcattt gcccttcagc catgttatga 
catgctcttg gacttcccag cctccagaac 
ttacccagtc tctggtattc tgttatagca 

35 

gaaatttgcc caaggtcatc aacccagcta 
cagatggccg gttcagggct ccttgcagaa 
40 tgactaagat taaaacactg tcttcacttc 
gtgccacttc accagccaga gacttccgca 
atgactgctg agctggctcc tcccacttgg 

45 

agcccggatc tcacacccac catgggcaag 
caagtgagtg agagtggggt ctggccatgg 
50 ggcagctcca ggggccagca caggcacaag 
tgccttaagt ggcttccacc ttgagcactg 
gaacacttag agatgccagc aactgcagag 

55 

tctggctcag ggagtaccaa gatctgggct 
tccggcaaac gagagcatgt caccacccat 
60 agcttggttg ttttacagct tgtttggtcc 



gggacccacc tgtttgcacc gttctctgag 76140 

tcaccacctc ctttcttatt gctgcctctt 76200 

catcctacag aactctaccc ctcaaacctg 76260 

ctgacatttc tgacctatga gttaattcag 76320 

gtgggtcatc ccagagaaag cactttctaa 763 80 

ctgaccctgc cctcctgaaa tcaccacagg 76440 

tcacctacta tattagtcct gtgttctagg 76500 

tagggtgaca gctcattgca taaaatgtta 76560 

cacccacagg gctccatccc ctccctatcc 76620 

agagatctca gagatctgta gcccacccaa 76680 

tggtttaaat ctcccctcca aaactcatgt 76740 

aaaagtggag tctttaaaag atgattagat 76800 

atgccattat tttgggagtg ggttagttat 76860 

attttggtcc ttgtcctctc tccgcctcat 76920 

tgcatcaggg ccctcagcag atactggcac 76980 

tgtgagaaat aaatttattt tctttatgaa 77040 

atgtaaaaca gactaagaca ggtagagatg 77100 

gaggtatagc caggtccaga accaggactt 77160 

ggaggagcat gtggtaaaaa aaaatcaggg 77220 

cttagcttga aatgacacag gatccttcgg 77280 

gccagcagca cctctgctag agttttgctc 77340 

cccagcaggc tgcgctcagc tcatgctacc 77400 

ttaggcatgg agcagaaagg ggtgtgtgag 77460 

tacacctgtg ctggctgctg tggtggggca 77520 

ctctgtgcaa ggctatggct ggacaaggca 77580 

gcatttggac aaggggaaca catggtgcct 77640 

ccccaaatgg gtgttacagc atgtcacagc 77700 

cccataaggg tcacagctct tctctcatag 77760 

agctcggcaa tctggccaga aacgtgtttc 77820 

tgctgccctg ctccagccca cagttcctgg 77880 
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gctggcctgg ccctgccctg cttcccatca 
gcaggaggga tacaatgtta cagctccttt 
5 ttgtcctgtg tccaggaaga atgaggttac 
aagagtttta ctgagcaata aaacagctct 
cctacccaga ggtgggtagt cccacgtggc 

10 

cccagactta ctgaattagc actctgggga 
ctgctgggtg attctgatgc acactctcag 
15 tcaaatgact gtttttctca ttttgagtac 
tgtttacaga ggcagcccat gccatctgta 
atgaactatg tcaaaacgga atgaattaca 

20 

aaaagatgac ccttcacctt tagacttgcg 
agcccaacca cttgctgaca gtctctggta 
25 ccttctcccc cttctcaccc cagcaaggct 
agttaaatca acaaccagga gtcaatacaa 
aaagcagcaa cattcgactt cacccaggta 

30 

ccagtgggcc aatccacaga ctgccagaag 
ctactgaaca ctgccagcgt gctcgcacgg 
35 atgaacacaa tcttatgaaa atggaagtaa 
aacttattct gttcaatgcc ctttgataaa 
gagtttaact gagcaatgaa caattcacaa 

40 

ctcagagact ccatgagtgc ctcgtgttca 
atgtactgga attggaagtg aggtacagaa 
45 ccttagttga aggcagcttg aatagtcagt 
ggattggcca atgctcagct attgttactg 
tgtctgacta ttaagctagg ttacagttca 

50 

gtccttctca ggccatattt agtttgcttt 

* 

tattgcaaaa atacagagtg gcattatcta 
55 ggggctcaga gaggttaagt aatttgccca 
ggtcttaatt catgtctcct cactccaaat 
ctctcacagt tctggaggct gggaagccca 

60 

gtgaggggct gcttcctggt tcgtagacag 
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tgcagagcag ctgcccagcg tcagtagaag 77940 

cgcagctgcc attccacggg tgctgggttt 78000 

atggacaacc agagagtgag caaggcagag 7 8060 

tagtggagaa gggccccaaa gcaggtagcc 78120 

tgagtccaga ggttttatgg gctttacgat 78180 

caggatccag caatctgttt ttgaacaagc 78240 

gtttgagaac cactgctcca cagcatcttc 78300 

cttctaacgt gatggcatac attatgatgg 78360 

gacaattgta ctgagaggct cttcttaaga 78420 

tcaagttttc atgtggaaga ttaaaaaaaa 78480 

gaatcagagt attagataag actctgtctc 78540 

ggttacgtcc cccatgttcc ctttggtcaa 78600 

agtatcaggt acacttttta tgtctcataa 78660 

actatataga tacagagggc aataaaggca 78720 

agagggtaat aaaattccat aaaggacaac 78780 

catcaggcaa ctgagcactc cccacctcca 78840 

gactggtagt gctcaataaa tgtctgtcaa 78900 

ttaacattta ttgagtcttt aatgtctgac 78960 

agaaaaacct cagccgaatt aaatttaaag 7 9020 

atcgggaagc ccccagaatc acagcagatt 79080 

gaacaaattt atacacaaaa aggaaacaca 79140 

acagtgagat tggttatagc tccgcatttg 79200 

ggtctatgaa tggttgaagt atggccgctg 79260 

atgcatacta ttaagttagg ttttcgattt 79320 

tccacaagga ctcaaatata gaagtacaga 79380 

aacacctttc agctgcatga tctcattttg 79440 

tgctttagga aagagaaata tgaaaaatct 79500 

agtctgcact gctagtaaat gatagagctg 79560 

cccatgctct tttgccactg caaaatgtat 79620 

agaccaaagt accagcagat ttggtgtctg 79680 

cagtgctctc actatgtcct cacatggcag 79740 
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aggggccaag ggagctctct ggagtctttt 
gcccctcatg acgtaatcac ctcccaaaaa 

5 

ctaggtttta acataggaat tttaggatga 
aaaaagctta ttgatacagc ttcagattcc 
10 cttgagttcg gtgtatcagg ggttcttaag 
ggatgtattt tttgttgcca caactggagt 
ggtagaagcc aaggataatg ctgaacatcc 

15 

gaattgtcta gcccaaagtg gcattaatgt 
tatctgaaaa ggctattaaa atactcactt 
20 acttcaacca aaacaacata tcaaaacaga 
ctcttccagt aatccaaaca ggaaagaggt 
gccacaaaat gtagtgtcct agcaacggtg 

25 

cctttcccta ccacttcttc cctgaagcaa 
cccaaccctt tgggaggctg aggcaggagg 
30 gtgagctatg attgtatcac tacattacag 
aaaaaaaaaa aaaaaagtag aaaagaaaag 
tccttatctc tgaataaatg tggttacaaa 

35 

tcatcccagt ttattaccat taagtcatgg 
cacgtcctca tcaaacctag cataaaaagt 
40 atttccttat gaagaccatc acatcaagca 
tctcttgtta atctgtcttt tgtcaaaagg 
gaaaaggtat ttctcctctc ctatgaacct 

45 

tttttcattt tggaaaatag aattattttt 
aaatatttta ttttaaaata gattagtaaa 
50 aatatggtat aaatatcaac aaatttaacc 
aataactttt aaaaatgcaa aggactcctg 
gtttatgctg ttattctcct cctcaaagcc 

55 

attcctgacc atggcattca agaccctgtt 
gtccaaatcc cagttgtcca gcctctctgt 
60 caggcatcca ttcactcatt catcaaatgc 



attatataaa gccactaatc ttattcatga 79800 
acccacctcc aaatatcatt acattgagga 79860 
caaaaacgtt cagtctatag cacagagtac 79920 
acactgcaag taacctttcg ggaactaaca 79980 
aagcgggaag aggtgaattt ggtaacgtct 80040 
tgagggaaag ggtgctattg gcatctagtg 80100 
tacaatgcac aggacagccc cttcatcaaa 80160 
tgagggctaa gaaagaagac taccagtaat 80220 
ttctaactac cgatctgatt ttcttcatat 80280 
caatatataa gcagatgtga aaatctagct 80340 
atacatacct gatatggttc aggtcacaca 80400 
aaaacagcag aaacaggaaa gtcatgctca 80460 
ctcataagac cctaattcga aaggtaccct 8052 0 
atcacctgag gccaggagtt cgaggctgca 80580 
cctagatgac agaatgggag ctcaaagaca 80640 
tgccctccct acaccgagag gaaaggaaca 80700 
atgatattaa acaaataggc cttgctaatt 80760 
ttttttattt aatcatactt cacaattatt 80820 
acacaagttt atctgtttct tcaggtcttt 80880 
aaacttatat taaataaatg tgtatgcttt 80940 
cccttggcca taaatgtagt gatggttgaa 81000 
ataaaacaat accactcttc ccactaattt 81060 
tatttaaaaa tatttatgtt aatatgtaat 81120 
tcatttttaa gtatctccat tttaatttct 81180 
cacataaaca agtgttcttc gagagtactt 81240 
agaccaaaga gtgtgggtcc tgagaccaga 81300 
ctgcaagggt tcctagcatc cacagaataa 81360 
cagctgccct tcaatccctc ttcccacaat 81420 
caatggctcc taccttgaac tcaccaaatt 81480 
caagcacctg cctctgcagg taagaaccac 81540 
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atcccacccc ttaggagctc agaagctcat 
gtggggcaag gatgcaggag aggttccgac 
5 ctttgggagg ccaaggcagg tggatagtct 
ggtctctacc aaaagtaaaa aatgggtctg 
aacctgtaat cccagctacc tgggaggctc 

10 

ggaggttgca gtgaaccgag attgtgccac 
atgagggcga atgaagctcc ttctcaaaaa 
15 ctgtcaaggg acaaagcact tatttttttc 
tgtttttggc tcttttgggg tatgggaaat 
aggttgggga gggtaaaaca gagaattgtt 

20 

ttctgagggc agctctgcag gggccggagg 
gagtggagag gaagaggatg gggcctagaa 
25 aaagggccca gctgaagttc tctgtgacag 
ccagcacttg atgtttagtt cattgctctg 
tacactttag taaaattccg ctacacagaa 

30 

tcaaggaagg gagagttttc tagtgctgcc 
ttaattcaac gtatcgtctc acagatctgg 
35 agggtgggct ccttctgggg ctgtgaagga 
gtggtccgtg ggcatcttgg ccactccttg 
ttcatgttca tgcaggcttc ttcctgtgtg 

40 

aacaccagtc atagtaaatt agggcccctt 
ctccgaataa agtcacattc tgagatacta 
45 ttggagggtt ccatttcaac ccacagcagg 
gtggacaaac agccgtaaaa ggggagatgt 
gcaaggaaat tgactatgta aagcgtaatc 

50 

atggagctac tatatatagg atggcggctt 
tgattctata aggctggaaa aattggagac 
55 acatttgggg catgtgggtt ccaaagagat 
ttgtggagga agaggcttca ggatgagcgt 
cacacctctg gttcacactc ttctggaata 

60 

tctttctttc aatgtccaac ttgagttaga 
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aaggaaggat acacacatac tcggataaca 81600 

aaaacgctat gggagcctgt aatcccagca 81660 

taggtcagtc tggccgacat ggtgaaacct 81720 

gtgtggtggc agccacctgt aatcccagct 81780 

aggcaggaga atcgtttgaa cccagaaggc 81840 

tgcactccag cctgggtgac atgagggtga 81900 

aaaaaaggta tgagagtttg gagtaggatg 81960 

aagtggcggc aggtttcctt aggaaataac 82020 

gaagccctcc tcccatccca agaaatgacc 82080 

ctgcacaata ttgcaggaag catctggaaa 82140 

acgaggctct aggatgggaa gaaaaggaaa 82200 

gcatggagga aggccagggt cctggagaag 82260 

gaacgagcaa gggactaaga tcagagacat 82320 

ttttggggca ttggcatctt tcccaccccc 82380 

ccactttgtg tgagtaaagc tgtaggaaaa 82440 

ataacaaagg actacaaact gggtggcttg 82500 

agactggaag tctgagatca aggtgtcagc 82560 

ctccattcca gacctccctg caggcttctg 82620 

gcctatagat gcatcactcc agagtctgcc 82680 

caagtctgtg caatatcccc tttttacaaa 82740 

ttaacctaat tacttttgcc atgaaactat 82800 

gatgaggggt taagacttca gtatatcttt 82860 

agatgtgttc caggactgag ttcatgtggg 82920 

gagggagatc aggagtgtat aggaaattga 82980 

tacatgaaga aatcctccaa cttattggga 8304 0 

tagccagttc atgtaaacta aaggaaaggg 83100 

acacacacac acaaacatca caggcagctc 83160 

ttagtttgac tgtaaagttc agggaaggct 8322 0 

tggaggatgg ggtgaatctg gacatgctcc 83280 

cactgtgcct tcagtgcagc tttgatccag 83340 

tgtccaattc catgtccagc cactcgacaa 83400 
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acctcctagg accagcccat ccctttgaca 
tgactgaata tttgtttgtt cctatccata 

5 

tattctgtct cgtcaactcc ttgcaaattc 
agcacacacc agacactcaa taaatgctca 
10 cagaaaatct tgacccactt ctggatttct 
atcccatgga atcctacttt agccaaggcc 
tggatgccag agaatagacg tattgactcc 

15 

tgaaagggaa acctaatttg agtttatttt 
catcagctag caccagaatt tctagatcag 
20 agcatccaaa gacttcccag gagcacaagg 
ctgacagagc ctggatagca ccagcagaga 
tcttgagaac cacaaggaat caggggttca 

25 

gcgtgcagcc acccatcctt ggaggtgctg 
caggttctca aattgagctc ctgtctctgc 
30 ggagactctg ctaggcattt tctggctgcc 
gtgatcaggt ctacttagga cctcaatcta 
ctcttttttt tcttcttctt ctttttgaga 

35 

tgcagtggcg cgatctcggc tcactgcaag 
gcctcagctt ccagagtagc tgggactaca 
40 tgtatttttt agtagagaca gggtttcacc 
cctcgtgatc tgcccgcctt ggcctcccaa 
tttgttcctg acattttcct gacgggaggt 

45 

atagggccca actgtttgtg ccaattgtca 
aaagaaggag tcctctgaaa aaaaggaaga 
50 gcagagtata acatttagca cagtattcat 
ttcttgtatt tcaaatctca acttcagcat 
ttaacctctc tgagcctcag agagtttgcc 

55 

tgggggaact gccttatatc tacagggtta 
tgttattcta aaceactctc ctgggaaatc 
60 aggtctaatg cagaagtggc tcacactgcc 



catctcccag ccacatcatc cctgatggtc 83460 

cttgcactag agttatccca acagtggtta 83520 

tagagcacct cgtatcctag cataccacct 83580 

ctaattgctt ataagggaaa tctattggct 83640 

cactttctca aagagaaact ttggaaggag 83700 

ataaatactc tgaggggcaa ataaacagcg 83760 

aaatgaaact gcaaagagga ctttggagct 83820 

ctagatgatt aacattaaat agtgccaatc 83 880 

gctgcagaaa cccacagagc ctgaagaaag 83 94 0 

taagacttct gtgaattctt ccgcaagact 84000 

cgtggaccta ggcggtccct cggaagtgtc 84060 

gacatagggt agtgccctcc agcaagacag 84120 

ggaggctggg aagtaactga agagtcacaa 84180 

atggacacag caatcttaag atagaaggct 84240 

ttgctcccta agcccagctg taggcacact 84300 

gaactggtgg aatggcttct gatggaacca 84360 

cggagtctcg ctctgtcacc caggctggag 84420 

ttccacctcc cggtttcagg ccattctcct 84480 

ggcgcccgcc accacgcccg gctaattttt 84540 

gtgttaacca ggatggtctc gatctcctga 84600 

agtgctggga ttacaggcag aaccattctc 84660 

agaatctcag gggtcccagg aacatccctc 84720 

ggtctacggc tgtattgttt ttcacacatt 84780 

gagaaatgat taatttttta atatcatgaa 84840 

tatgaaaatt gaaggcaggc agactatcaa 84900 

ttcctagctg tgtgactatg gtcaaggcac 84960 

cgagaaatgg tcataacaat aatatattgc 85020 

tgaggcatca acatccaagg atgctgtagt 85080 

tgttattctt caagaaagca aggaatgtca 85140 

tgcacagcct ccattcatgc agctgaatgt 85200 
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tccaccatcc atatgtcact ggtagaacat 
gcaaatagca gtcagaaaaa aataaatttg 
5 attgtagaca agttattcca ctttcagagc 
tcggatcaaa atgagtgaga agacaggggt 
tgacagttgt ttttcctatg gaagagtagc 

10 

tttaaaataa ggaaaaagct gggaattttc 
gttgggatcc aaggaaaaag aagtgaaacc 
15 ggaagtggtt ctcaaacttc agcagcatga 
atggctgatc cttgcccctg gactttctag 
tttccgacaa gttcccggtg atactgatac 

20 

tctgctatta atactaaatc taatagtttg 
ggcccctctc ttttttaaga ttggggtaac 
25 tattttattt tattttattt tattttattt 
acgtgcaggc ttgttacata ggtatacatg 
cgtcatctag gttttaagcc ccacatgcat 

30 

ttgcccccca accctccgac aggccccagt 
ttctcattgt ttaactccca cttatgtttg 
35 atgatggttt ccagcttcat ccacgtccct 
gctgcatagt attccatggt gtatatgtgc 
gggcatttgg gttggttcca agactttgct 

40 

ctacctttca acaaagaaca aatgttaact 
gttttctcaa tcttcaaaac accctttgga 
45 ataagaaact gacagcgtat gcaacttgct 
tgagaagtgg acttagggct gtctgagctc 
tactttatag acgattaata ttaaataatg 

50 

tgtccattga caatttggag tgtttctctc 
atttatactt gtgatttagt gaggacagaa 
55 gcacacgtta ttttgtgcca taggatttgg 
gtaaagggat aagcccacca gccacaacaa 
ctctcatctg aatgggatat cttcgcatct 

60 

tgcctacctc ctcccattgc acacttgctc 
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ttcttctggg aaatgaaaac atagtgaaat 85260 
agtctcagat ctattcctgt gggttgcata 85320 
cttggtctcc ctactttgca aagaggcggg 85380 
gaatgcctta gccatcaggt gtcagaccag 8544 0 
atggaaaggg gcctcagaga ccagctagat 85500 
tgaagcttta tataaccaat attttgaact 85560 
acaaaggaga agtgtagggc atttctgcat 85620 
gattcacctg gagggtgtgt tgaaacacag 85680 
tagctctggg gaggggacca agaatttgac 85740 
tgctagtcca gagaccgtac tttgagatcc 85800 
gggtcaatta ggaaaaatat tcacaacaat 85860 
acttttattt tattttattt tattttattt 85920 
ttactctaag ttctgggata catgtgcaga 85980 
tgctatggtg gtttgctgca tctatcaacc 86040 
taggtatttg tcctaatgct ctccctcccc 86100 
gtgtaatgtt cccctccctg tgtccatgtg 86160 
gttttctgtt cctgtgttag tttgctgaga 86220 
gcaaaggaca tgaactcatt cttttttatg 86280 
cacgttttct ttatccagtc tattgttgag 86340 
aaacaatggc ccttctctta tgggtggtct 86400 
gttccctctg tgccaggttc tatgctggca 86460 
gaaggtgcca ctattatgca ccttttatag 86520 
caaggtcgta caactaataa atgaaggaac 86580 
ttaaccctga tactgtcttg ctttcgagtt 86640 
ccaattcatc agctagcacc aaactcaaaa 86700 
actcaggatg cagctgggac agggggatca 86760 
agcctcctga gaacatacct ctggggtcct 86820 
tgtcttgcca gtaaggactg tataatctaa 86880 
ttaacaagta aagtacacta gtaa.cattat 86940 
aataatatct ttaatgttct gagggctaca 87000 
cccagcatca aaaaattaag caattctcta 87060 

-323- 



WO 02/062946 



PCT/US01/48437 



agtccagctt atcagtcctc atcaaatcgg 
tcccttctca actttttcta atgctcaaga 

5 

tgatcactgt cataatctct aggaccctgg 
aacaaaattt ctacatggtc gcaatcccaa 
10 tgaaagggat gatcattccc aggccacttg 
cccaagaagc ctgggaaaga cagtctgttt 
atcatacaca aaatgcctta attaaattaa 

15 

tcaaggacag aagtacagct ggggcaaatg 
caaagaccaa ggtagtatct gtgtgtatgt 
20 gtgtgtatct gtgtgtgtgt ctgtgtgtct 
cctccttccc tctctctgtg tgtgtgtatg 
tccctccttc cctctctctg tgtgtgtgag 

25 

cctccctcct tccctctttc tctgcgtgtg 
ttctccttcc ctctcgctct ctctttctct 
30 ccttctctcc tcttctttta tctctccctc 
ctctactaac tggtgttctc tgctccccct 
ctgcctagcc tcccatccca cagtagtcct 

35 

aagttaccag agcatttggg gtcagttgtt 
agggggcaag atggggaggt agcctgtgcc 
40 aatgccaact gcagggggct aagcaaagcc 
agaagctaag tagcctcggg aaaccctagt 
agtgtgttct tgccaagaaa ggcaagaagt 

45 

cttattttaa tgaagatgaa actaaagata 
tcagaagagg gaaatgcacc ttcttatgtg 
50 aggtgattgg agccaacagt gtacctggca 
agcccagcaa ccctgtggcc tctggaaatc 
caatctcacc aaaagtaatc cactgtgata 

55 

tggtgaacca gaactcagca gcaggtgtac 
atccagtggg aacagccagt ttgcagacgg 
60 ccacgaatct ccagacatgg cctggagacc 



gacaatatcc caggtctccc cccaagtgat 87120 

ttcagtagaa actgcaggct attaagggtt 87180 

accataatgg tttaaacaca aggaaaatgc 87240 

caccaacatg gacaagaccc atgaagagat 87300 

gcccatggtt cctcctggat gtcaagggca 87360 

ttttgctctg aaaccaatta atgttgaaaa 87420 

acccataaat gagttaacta gttatagact 87480 

agaacctaga accgagagtc aaaaggacat 87540 

gtgtatgtat gtgtgtgtgt gtgtgtgtgt 87600 

gtgtctatgt ctgtgttatc tgtcttcctc 87660 

tgtgtgtgtg tgtctgtgta tctgtcttcc 87720 

tgtgtgtctg tctgtctgtg tctgtgtctt 87780 

tgtctgtctg tctgtgtcta tgtgtcttct 87840 

ccttccccct tttcctttcc ctctctctct 87900 

ccccaatgct tctcagcttt attctcttca 87960 

gcacacaaca tacatgtgac ccctccaact 88020 

tgggtccagt tccaaattcc caggagaaag 88080 

cacccccaga tcaatcagct ttgccagggc 88140 

cactcaacaa actcaaggca tgcagattaa 88200 

accatccaga ctgtgaacaa aaggtctaca 88260 

cttgcaaatc caaatccgta aaaggaaaac 88320 

ttattttgtc tggtaccatg ctaggtgatt 88380 

agaatgagat aagatgcatt attcattttg 88440 

ttttgcagca gttgtacacc atgaccgaac 88500 

ttattgcccc agataatgtt cacgtcatcc 88560 

atctgcagcc ttcagaggtg accacttacc 88620 

caggaagagc aaacttacag aatctactcg 88680 

agagtcaacc cattgggtat cagcgacagt 88740 

tgcctggagt gatccaatac acacagggaa 88800 

ttcagaatcc tctgaatgct aaccctgggc 88860 
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tgacacacac ctcaaactca tcccagtgga 
atcccaagaa attcataaat gaggaggtca 
5 ctctccgatt tgggtcactt cataagccct 
gtgttgagag ggtaaagggg aggtggagtg 
ccagtacaat cacccagatg ctggcccctc 

10 

acccatggtt tgggggataa aggtttgaat 
atctatcagg ggcatgggtc agcaatagtt 
15 ggcaatccag gtctcaggcc agctagattt 
gctctgggcc tccatttcct catctttaga 
ccagctccaa ggttttgtga tttccttttc 

20 

tcctccagtt tcatttagca ccttccaaag 
ccagatgcct ggctggatct ggatgcctgg 
25 cagatgccca tactctatgc actcatccat 
gagactcaga gataaccaaa acatcatcca 
agctgtcata attacaatac cttctggcaa 

30 

ctgcccaaag tgcaaattgt ctggcttctt 
ggaagacaag gacaggtgca ggaaggggac 
35 aagtagattg tgccaggaac aaggcgttgt 
tgaactaaat tctataaaac agcaatcagg 
ggctcatgcc tgtaatccca gcactttggg 

40 

aaatcgagac catcctggct aacatggtga 
tagccaggtg tggtggcggg tgcctgtagt 
45 atggcgtgaa cccgggaggc agagcttgca 
cctgggctac agagctagac tcagtcttta 
tcacaagcat caagtgagat tataaatgtg 

50 

ctatgatatt gcgtagtaat tatcatcttg 
atactgggta aaggtttgtt aactttggct 
55 ttcaaaattt gtcaaaacag aattaagtca 
aaatgaaaag ccaaaaagcc aaaactgcag 
gggaaggaaa ctatgagtgc ctcttagccc 

60 

gcagcacttg cagctgcagt atttagcccg 
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acacgtcatt tgcatcattt acttcattta 88920 

gaacattagg ggtgagtgtg atttctcctt 88980 

cgtctagtgc tctgggatga ggaagggaat 89040 

ggtgtggggg gcagtttctg gaggctggac 89100 

tcccttgtct tccttccact ccccaaggag 89160 

aatcagtgaa catccccacc ttacccagct 89220 

ccactaaaaa gaccactgag cagggagtca 89280 

gctgagtggc cttgcagaca tctcttcctc 89340 

ataagaagat tcagctttct taaattcctt 89400 

aatcaagggg gcctctgcct ctttcctgcc 89460 

ctcaattcaa ttctgttttt gaggcagaag 89520 

ccaactctta gtatggagac tggcgtgtca 89580 

tcaacactgc acgtgccctg caaggagctg 89640 

tgaagatgaa ggagggtctt tgttcatggc 89700 

ttgccatgag acatgcataa gttattgtgg 89760 

tgagaagtta caaactgcct cctggaggca 89820 

ccattcctcc ttatttaatc ctgggtagac 89880 

ttgcctccca aacaaatcaa aatcaaatcc 8994 0 

aatagaaatc ctggtcaggc cggacacggt 90000 

aggccaaggc aggcagatta caaggtcagg 90060 

aaccctgtct ctaccaaaaa tacaaaaaat 90120 

cccagctact ctggaggctg aggcaggaga 90180 

gtgagccgag acagcgccac tgcactccag 90240 

aaaaaaaaga aaaaaagagg gaaatcctgg 903 00 

tgcggaaatc ctccaaaagg taaagctggt 90360 

tattacaatg actatcatta ttattaatta 90420 

agcaattatt ctaagatatt ttggaaggat 90480 

gagcaaatat ctggggaacc tggccaaaca 90540 

actatcttta atcaattaca attttgtttg 90600 

ttttcccgat tgccccaaga atattcgcca 90660 

agataacttt gccaatgtct cccttttatt 90720 
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ataactttcg ccttgctcta gcatattgac 
agcattctgt ttttagtagt agtattccca 

5 

aaaatgtcaa atcctagaaa acgtagcatt 
cagttgttgg ccaaagattc atttgatgaa 
10 gttagttctg tttagaaatc actcgaaaaa 
atcagccaga tttgcttcag cctcaagagt 
agcgagcggc actatttttt ttctaaatgg 

15 

gggcctgtag acagagtcaa gcatgtcgcc 
tctgtcacca actcatagac tcatggacca 
20 tcatttgatc cctgaggaaa ctgaggccca 
ccaaatgaga ggggattcat cctggacggg 
agggaaggat aggatttcag gaggaggaga 

25 

cagtggagga aaagcaggga gtaaggggaa 
tccccttgtg tacgttgcca aggggaactg 
30 aatatcggta ctgggaagat gaatggagtc 
caggcctggc gtggtggctc acccctgtaa 
gttcacgagg tcaggagatc gagaccatcc 

35 

aaaatacaaa aaaaaattag cctggctggg 
gaggctgagg caggagaatc acgtgaacct 
40 acgccattgc actccagcct gggcgacagg 
aaaaaaagaa acacctactg catgctagga 
atctctacca cagccctgca agctggatac 

45 

ctcaaacatg ctcttgaggc tgaagcaaat 
taaagacctt tctttgagtt atttctaaac 
50 gaatcatcta ttttggaaaa atcagattca 
gaacgatgtt aacatcacgt gtaggaatgc 
tccctgcaaa taaggaaacc ctgaggcaga 

55 

ccagtgctgc agccaggaca ggagccaggt 
tgtctctgca cacttgtggg ctcagtctaa 
60 tgtcttacct ttcctttatg cctgtgattc 
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cttggaagca agagacatca 


ccctcictcat 




tttacaaaat acagtaattc 


tcgattacta 


y uo4 u 


tctacgcatg gtgttaacat 


tgttctcaaa 


y uy uo 


tccagtgttt ccgaagtaaa 


caattctaat 


QAQf ft 

y u you 


ccgtttttat catttatttt 


cacaatgaaa 




gtgtttatgt aaaattaaat 


gagcgccggc 


y x u o u 


gaacagggtt cattaatata 


ctgataaagc 


y 114 u 


ttaggaaggg atgttcatgt 


gaaacccagg 


y iz uu 


ggaagacaga aagcagctgc 


cctaggctag 


Q1 Af A 

912 6 0 


gagaagccag tggtaaatgc 


tgtaggagcc 


91320 


ggctcaggaa aagaatctct 


ctggtcctta 


91380 


gtaaagcaag cagatttcag 


acagtggcaa 


y 144 U 


aaacacagga cttgcagggg 


acagaggatc 


A 1 C A A 


tggggacgtc atagaggagg 


ccacaaagag 


91560 


agcagagggc attgaaatac 


ttactgcatt 


9162 0 


tcccagcact ttaagaggcc 


gaggcgggcg 


m c o a 

yioou 


tggctaacat ggtaaacccc 


atctctacta 


y l /4 u 


tggcaggcac atgtattccc 


agctactcgg 


91800 


gggaggcaca gcttgcagtg agccaagatc 


Q 1 O ^ ft 

y loou 


gcaagactcc gtctcaaaaa 


aaaaaaaaaa 


yiyzo 


actgggctac aagtttcata 


ggcactttta 


A 1 A O A 


tgttaccccc atcttgcaaa 


tatgtagcct 


Ol AvI A 

9Z04U 


ctggctgatt ttcagagtga 


aaataaaata 


QO 1 ft ft 

yz iuu 


agaactaaca tcgaatggtc tgaatcatca 




tcacatgaat ttcggccgac aactatctga 


QO lift 

yz z z u 


tacgttttct aggatttgac 


atttttggtt 




gaggatattt gctggagacc 


acacagccgg 


92340 


ggttttcagc cacaacccgg 


gctcggcccg 


92400 


agggaacaca tttcttgctc 


tgttataggc 


92460 


tcagcctgag tgcaaattcg catcacctgt 


92520 
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gaagttttta aaaaaacacc caggtgtgga 
tctgggatga gcccccaaca ccaatcaccc 
5 caaagcacag ctccacagcc cacaggcccg 
ccctgcaggc catccagatc ctcatcggcc 
tgctgtatta ctatcctttt gtgacctggt 

10 

ccgtgagtac aaggccatat ggtctccttc 
tggaggaaga aggagcatgg aatcccattc 
15 gcctccaact tgcagataca gcaaagggtt 
tcccaccacc accctcccca tctcccatac 
tacaaacact cattatcatg ttctttacct 

20 

gatatttatc tgataaggcc aggcagggtg 
ggccgaggta ggtgaatcac ttggggtcag 
25 agacctcccc ccacccgcca tctctacaaa 
tgcacttgta gtcccagcta cttgggatgc 
gtggaggctt cagtgagcca tgatcacacc 

30 

agcctgtctc aaaggaaaaa aaaagaaata 
gtgtgctata gacagcatct acctaccata 
35 aggaattatg taaaatgatt gtcaaaccat 
tacatacgac atggaaatca acacgtgctc 
acctcataag gcagatgtta atcatttacc 

40 

gagcttaaat aatattaata aattgaacac 
cctagggtgg tcaaattcat aaagatagaa 
45 gtaagagggt aatggggagt tagtgtttta 
aaggagatct tggggtggct ggttgcacaa 
tttttttttt ttttgagggg gagtctcgct 

50 

atctcagctc actgtaacct ctgcctcctg 
cgagtagctg ggattacagg catgtgccac 
55 agagacgggg ttttgccatg ttggccagcc 
gccccccttg gcttcctaaa gcgctgggat 
caatatgaat tttcttggtg ctagtaaacc 

60 

ttaagttgtg tgtattttac cacgattaaa 
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ccctaccctc agagatttgt atccatttgg 92580 
ctggggtgat tctggggcac agccagggtg 92640 
cagttctgct ctctcacagt gacttgttct 92700 
tgacgcacat tttctctgca attaaccctg 92760 
tgtcagggta cccgctctgg ggaggattat 92820 
ctgggtcacc agactgggac aagaaggaag 92 880 
ccagtaatat gactctgtag actagagcca 92 940 
atatcaactc cctgccccag caacactccc 93000 
cccccaccca gcctctaggg aactattggg 93 060 
cttccaccct ctctgggttc tcttttatga 93120 
gctcacgcag gtaatcccag caatttggga 93180 
gagtttgaga ccagcctggg caacatggca 93240 
aatacaaaaa attagctggg tgtggtggtg 93300 
tgaggtggaa ggatcacctg agcccaggag 93360 
actgcactcc agcctgggta acagagtgag 93420 
tctcatatgt aactcagagt acgtaaagtt 93480 
ccagttatgg gagccgactg caaaatttcc 93540 
ttggtagcag gaagttggcc atgttgggta 93600 
caaatcaggg cccttccctg cagtgctgcc 93 660 
agcaggccat tcagtatatt tcatcttgca 93720 
ccacatgtac aattccactt agatgaggtc 93780 
aaagcagcat ggtgattgcc aggggctggg 93840 
tggggactga gtttcagttt gggatgataa 93900 
caatatgaat tttcttttct tttcttttct 93960 
ctgttgccca ggctggagtg cagtagtgcc 94020 
ggttcaagaa attctcctgt gtcggcctcc 94080 
catgcccggc taattttttg tatttttagt 94140 
tggtctcgaa ctcctgacct caggtgatct 94200 
tacaggcatg agccatccca cccagcctaa 94260 
aaacacttga aggggtttga atggtatgtt 94320 
aatagtaaat taaattaaag ttttaaaaaa 94380 



-327- 



WO 02/062946 



PCT/US01/48437 



tgaacaccta cgtgtccacc acctaactta aaaaacaaag catcacacgt ggtttttaaa 94440 

tttttaaatc cctataagta tcccctctat gccccgagag gtaaccacta ttttcaattt 94500 

5 

aatgtttttc tttttttttt ttttttgaga cggagtctcg ctctgtcgcc caggctggag 94560 

tgcagtggcg ccatctcggc tcactgcaag ctccgcctcc cgggttcacg ccattctcct 94 620 

10 gcctcagcct ccagagtagc tgggactaca ggcgcctgcc accgtgcccg gctaattttt 94680 

tttttttttt tttttttgga tttttagtag agacagggtt tcaccggggg ctcgatctcc 94740 

tgaccttgtg atccacccgc ctcggcctcc caaagtgctg ggattacaag cgtgagccac 94800 

ggggggtggc cgatttaatg gttttcattc ccttgctttt tttttttttt ttttttttga 94860 

aaaggggtca ctttgtcacc caagctgaaa cacaggggca caattttggg tcactttaac 94 920 

20 ctctgcctcc taaggtcaag cgaatggccc gccccaaccc cctgaataac tgggattaca 94980 

ggggggcccc acccctccca actaattttt gttttttcaa taaaaacagg ggttcaccat 95040 

gttaaccagg gtgggctaaa actcctgggc tcataatcca cccacctcgg cctcccaaag 95100 

tgctgggat 9510 9 



15 



25 



30 



<210> 81 
<211> 19303 



<212> DNA 
35 <213> homo sapiens 



40 



<220> 

<221> genomic_DNA 

<222> (1) . . (19303) 

45 <223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 

<220> 

50 <221> coding_region 

<222> (8830) . . (9012) 

55 

<220> 

<22l> coding_region 

60 <222> (9468) (9587) 
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<220> 

<221> coding_region 

<222> (10759) . . (10815) 

<220> 

<2 2 1 > coding_region 

<222> (12466) . . (12597) 

<220> 

<221> coding_region 

<222> (14049) . . (14159) 

<220> 

<221> coding_region 

<222> (16890) . . (17009) 



<400> 81 
atttttctaa 


agtacaaagt 


catcatgtta 


ctgttctgcc 


taaacttatt 


caggctctcc 


60 


tttgcccttg 


ggctaaagtc 


caaatgctct 


agcaacactt 


acaaagccct ggagatgtag 


120 


cagttcacct 


ctccagcccc 


atccctcctc 


accctccact 


tacacttgac 


cccgagccat 


180 


cctgagtctc 


cgtgagctcc 


atctgcctgg 


aaggctgctc 


tctgcctgct 


tccaaccctc 


240 


ctatttgtcc 


tgcttragctc 


ccaaccccac 


ttcagacctt 


catttagata 


ccacttcctc 


300 


caggaagcct 


tatctggcct 


catcccccaa ccccttctag 


attaggggaa 


ctcctccagg 


360 


ctctcgccac 


acccacatct 


cagtatcaca 


tcactcaccc 


gtccacagca 


cagttacctg 


420 


tctccctgtt 


catcttattc 


aatggacttc 


atattctttc 


cctattgctg 


ctgtaacaaa 


480 


ttaccatgaa 


cttagtggcc 


taacataaca 


cagacttatt 


attagacagt 


tctgaggtca 


540 


gaggtcctaa 


aatccaggtg 


tcggcagggc 


tgagttcctt 


ctggaggctc 


cagggaggcc 


600 


ccgcttcctt 


gccttttcca gcttctagag gccacctgcc 


ttgcttggct 


tgggaccttg 


660 


gcctccacct 


gccaagctct 


tccactcact ctgaccctct 


tgcctcccgc 


ttgggaatac 


720 


attgggccca 


ctgaataatc 


tatccctatc 


aacaactttg 


acttaatcat 


ctgcaaagcc 


780 
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cttttcactg cataaggtaa cgtattcaca 
ccattgctct gactaccaca aattctgagc 
5 ctcctggatc cccagcatgt cacacagcgc 
tatcaatgaa cggatcagtt aatgaacaag 
actcagcaaa aaagcactgg tcctggaaaa 

10 

cgtgggtgat gaacgggtgc aaggcaacca 
agcattacag tgggcaggca agtgccttca 
15 ctgctgctca gcctggcccc ctacccctca 
gaggggtgag tggccttttg cctgattgtg 
acatgcatga cagctgagca cgagcgaccc 

20 

cacggccact gacacgtgtg aggcgctgct 
aaccctccag caacttgatg ggtactatta 
25 ccaccaaagt caaataactt gcccaaggtc 
gaactcaggc atcccagatg cagagtcagc 
aaagtcagtc agtcctggca tcagggacta 

30 

tccctgtcca ccctcagcag agcgatgtgg 
caaggcggga tctaggggtc gctggatctg 
35 tgggggttgc tggatctggg agttgtggga 
gatctggggg ttgctggatc tgggagttgt 
gtgggatctg gaggttgctg gatctggggt 

40 

ggttgctgga tctgggggtc actggatcgg 
tgggagtcac aggatctggg ggtcacggga 
45 gatctggggg tcacaggatc tgagggttgt 
gtggtacctg ggggtcgctg gatctggggt 
gtcactggag cccactccac acctcctcca 

50 

ttcccagagc agcaccgtgc tcatggaaag 
gtcagatgca atttgtgggc catgcctgga 
55 taacaagaca tttgtgagat aatcagaaac 
taagtaccaa aaactagata agaacaggtt 
cacaacaggg gtccccaacc cccaggccat 

60 

accaggcctc acagcagaag gtgaggggca 
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gattctaggg atgtggacat tgttgaggtg 840 

tcctcaaggc caggggcagc ctcttgccca 900 

ctggcccaag taagtgctca ttagctgttt 960 

ggtatcaggg gggtgggggg ctgtcttcct 1020 

acactcaccc catgtttgag gagctaaagt 1080 

atgacagtga ttattggcct cacgtgtctc 1140 

gaagaaacag gtctctctca aagctactgg 1200 

ggggaagcct gcatagtcca gcaggtctga 1260 

tgtgtgacat cgaagtctat gccacattga 1320 

cggaagcaca cagcaactaa aataattagg 1380 

tttaagcatt ttacatattc taactcattt 1440 

tccccatttt atagagggga aaactgaggc 1500 

acacaacatt aagaggtaga atcaagattt 1560 

tctgaagtgc tatgctgtgc tcagcaagtc 1620 

tctgtacagc cgctggctct gcctcgcgct 1680 

agagagaagc cagatggaaa aggaagagcc 1740 

ggggtcgctg gatctggggg tcacaggatc 1800 

tctgagggtt gttggatctg ggggttgtgg 1860 

gggatctggg ggttgctgga tctgggagtt 1920 

ttgttggatc tgggggtcgt gggatctggg 1980 

gggggtcact ggatctgggg tcataggatc 204 0 

tctgggggtt gtgggatctg ggggtcatgg 2100 

gggatctggg ggtccctgga tctgggggtt 2160 

cactggatct gggggtcatg ggatctgggg 2220 

ccagaacagc ctcacggaag accttcacac 22 80 

atcagggcac aagatgtggc aactgattct 234 0 

tcctggatcc tgatttgaac aaatcaaaag 2400 

tgaagctccg actagttatt tgatacccgg 2460 

tttctgactc ctagggttaa gcttgtcccc 2520 

ggagctatac cagtctgtgg cctgttagga 2580 

acaggcaagc atgacctacc acctgagtgc 2640 
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15 



25 



cgcctcctgt cagatcagcc gcagcaatag attctcaaag cagaacgggc cctattatga 2700 

attgcacatg caagggatct aagtggtacc ctcatgagaa tcgaatgcct gatgatctga 2760 

5 

ggtggactaa tttcatccca gaaccatccc ccatccccac cccggcccat ggaaaaattg 2820 

tcttccacga aactggtccc cagtgccaga acggttgggg accactgctc tatgaaagta 2880 

10 gtttcttttt tcctttcctc tttttttttt tttttttttt ttgagacagg gtcacactgt 2940 

gttgcccgga ctgaggtgca gtggtgtgat cacagctcac tggactcaag tgatgctccc 3000 

acctcagcct ctcgagcagc tggaaccata gacagggacc aagacacctg gctcattttt 3060 

gaatttttga atttttgtag agatgaggtc tcgctatgtt gcccaggctg gtctcaagct 312 0 

cttggctcaa gtgatcctcc cactttggtc tcccaaagag ctgggattgc aggcatgagc 3180 

20 caccgcccgg ccctttaaaa gactagtttc caaatgttca acagagctct gtggggctgc 324 0 

agggttaaac tctgagcgta aatagacctc agtgagcacc aggactcggt tttaattttt 3300 

ttgttctgat catgtggtca cacttttttt ttaaagagtc catatctttt agagataaat 3360 

actaaaatgc ttacaaataa aatgggaaga tatctgagtt ttgcttactt ctgggggaga 3420 

ataagattgg ccaagagctg ctgatggtgg ctgtagctcg cgaatgggca caggggtgtc 3480 

30 atggaaccat tctctctact tttgtgagtg cttgaaattt tccagaatta aaagaactct 3540 

ttcaaaaggt tcaacttctt actccactcc caggatctaa gacgattaca accactttca 3600 

ctccatcagt catgccctga aaacagatta aattatcaac atttcccttg ccaaaggaaa 3 660 

tttacaactg tcaaggagag acataaagaa gggcactaag attttcacct gccggggttt 3720 

ggcgggaggg ggaaggactt tgacagcttc catgcaaccg gggcatgaag cagggcgtgg 3780 

40 gtccaggagg gagaagatcc aaagtcgaat gttgtaaaga tgtgctgatt tcccagcgct 3 840 

gtctctgggg agctccggca gcgcaagagg gcaaagcaca gctggaagct cagagctgca 3 900 

gtcccaggtc ctggggtgag tggtcccccc atggcaggaa cagaggggag tgattttgtg 3 960 

tttccatgtg gagttggggg gttgctcagg tgtacagacc tggtcccccc ttgaagccag 4020 

gggaccggcc tcttataccc ctagccaatc agttgttggc tgtagggttg gagcggtgtt 4 080 

50 acttaaactc ccagaaggga ggagatgtga gcctttagca atgaacaatt ataccagctg 4140 

gagaatgggt ggcccaccca gcagagggga tctgggaagt agggggcacc gtcggcctct 4200 

cccctgcgga gccccatcct gctgccagtt actgaccctt attgctggag gagccccaga 4260 

cacttttctt gcagtctgga aatgcataca cacagagaga ccagttcaag aaaacagagc 4320 

cccagggagg agaagggact tgtcgagggc cacgaggggg caggactaga atccagattc 4380 

60 ttgactccct ccttggacgg aggcctctgt ttcctgcagg ccttccagct gctggtgggt 4440 



35 



45 



55 
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gagctctctt tacaccaggc acactgtctg 
gtcgtaggtg gttttctaac tggcattcac 
5 cccaacacca gcctgcctgt cacatgttgt 
cctgcctttc tgtccttgac atgcatgcac 
acgcacacac acacacacac acacaccctg 

10 

tgcacagtct gatgagggtc atattattca 
gacctcccat ggacacatct tactcactcc 
15 aactccacaa acaaatccac gaatgacagt 
tcagtgcctt tcttccctca cgacccccta 
gccctctccc cctgtaagca ggatggggat 

20 

agggacaggg tgtatcatgt cttaaccaag 
ctcacagccc tctccccctg taagcaggat 
25 gatgataggg atggggtgta ccatgtctta 
ctcactggtg ggattctaga cacgcaacag 
ctatacgcca gtccctgtca gttgcaggca 

30 

ctagggacag gcagtggcac tccccatgag 
taaaaaaccg gcccaaggtc aaagctccgg 
35 ccattctagc tgacaacttc cagcgctcag 
ggttctggag ggtcctgccc agctctgtcc 
aacaacgaga atatgtgtgg gacccaggca 

40 

ctagtggcag tgggtttccc gtgggatacc 
ctgagggggc tggggttggg cggggaaagg 
45 ctgctgcctg ccccctctct gtcagcacac 
tggaagtggg ccaggacagc tgggaataag 
gtcttggaaa ggcgtacgag agagtcataa 

50 

ccctctgcaa ctccttcctc tcctggttgt 
gtgagagaag tattatattt aaagttggaa 
55 ttcccactgt gatatgcacg tgagcaaacc 
ccaggaccag aaattgaaac cttcaaggcc 
ccaattgcat agaaccgtta gacaatgaga 

60 

gtggctcaca tctgtaatcc tagcactttc 
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gaagcaccag gaggaggctg gttcagtaca 4 500 

tgaaacgctg atcctctaag gcaccccacc 4 560 

gacactcttg tcactcagca cctctcaatt 4620 

atgcatgcac acacatgcac acacacacac 4680 

agaagctctc tctgtcctct agggctccac 4740 

tgccacgttt cctctgcctg ctcagccaca 4800 

tgcctcggtc cctgcagtgt ccagtgcaga 4860 

gttggatctc agatcactag ctcaatacac 4920 

cccccacccc agggccccag agctctcacg 4980 

ggaggcaata aactcaccac tccagatgat 5040 

aaccagacga ggacaaggtg gagcctcgct 5100 

gcagatggag gcaataaact caccactcca 5160 

accgagaacc agacgaggac aaggtggagc 5220 

gccagtggca actgtggctg acatttattg 5280 

ttgtttcagc tactcctcac cacctacccc 5340 

aaggtgaggg gagactcaga tgctgagatg 54 00 

ggagcagagc caggactttg cacggagcac 5460 

ggccgtgccc actctgctgc tttgctgggg 5520 

ctgtcctccc tccactccca gaactgaagg 5580 

tcccgtggcc ctgtgcagag cgctgaggcc 5640 

aagctggctg ggtccgtggt cctgagctgg 5700 

ctaaaaaagt tcctttatgc accaaacagc 5760 

tgtcacctcc tagcagccct gcacaggtcc 5820 

tgttctaagg aatcagagtt catgctcttt 5880 

caaacttttc cccatgatga ctaaggtctg 5940 

ataaaattcc ttgtagcggc tctctatccc 6000 

gggaccttag agatcatcac tcaattccca 6060 

aaaagcagag aaaaggccaa aaaccagagc 612 0 

actttgccca aaagcagacc ccacagaata 6180 

aaaaaaagat cctatggcca gccgggcgca 624 0 

ggaggccaag gtaggcggat cacctgaggt 6300 
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cgggagtttg agaccagcct gaccaacatg 
attagcaggg cgtggtggca ggtgcctgta 

5 

gaatcacttg aacccgggag gcggaggttg 
agcctgggtg acaagagcga aactccgtct 
10 aataaataat aaagattata tggccaatta 
gagtgtgcca cttgtagaaa atctcagagt 
aaccatggtt tctcaaactt atttgaccat 

15 

caatgttcca tggaacatac tttgagaaag 
gttggatatt tgaaccattc ctactgccac 
20 tacagctcca gtgcccaggc actcactatc 
ctctactaag aagtccctct tcttctgttc 
tattgtcctc ccaggtgtgc agagtctagg 

25 

ctgccaccga atacatccct tagtccagct 
ggaaccttcc caagggacag ttcccagagg 
30 acaaaaagaa tcatacagca ctgtgtctat 
gagaccctct gcccactcac cccactaaat 
atcaatcagg gggtcctacc cagcagccca 

35 

atgcgtggtt ccaattccag gcacttcctc 
tctctgaacc ttggtttcct cttctatgca 
40 gttgccaggc aaaactgcct cctggcacag 
aggccccatc tggctctttg gctcttgata 
aggatatagt cacctgggct atttgtccct 

45 

cctgtggaaa atgtggatgc tcaatggaac 
gagaggaatg gccctccctt ttcattcatc 
50 ctcaaggatt tgctcattag ccccatcccc 
gcaagccagc tccctttctt ccccgcccag 
ttggattttc tcaccaagtc ccaggcagcc 

55 

tatacactat accaggccct gtctgcaggt 
cccaactcct ttttgtgccc atgagaacat 
60 acagtgcatg aaggacttaa gccccaggct 
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aagaaacccc 


atctctacta 


aaaatacaaa 


6360 


atcccagtta 


ctcaggaggc 


tgaggcagga 


6420 


cggtgagcca 


agattacgcc 


actacactcc 


6480 


caaaaataga 


aataaataaa 


taaataaata 


6540 


agtttgggaa 


atgcccttaa 


tactaaaaca 


6600 


ctttaatatg 


agaaaaaggg 


agaagaggct 


6660 


ggaacccttt 


tactgcagat 


acatcaggac 


6720 


gttattctgg 


attcttgagt 


ccaactccct 


6780 


agctctcaca 


agtgaatgtc 


taggtggagg 


6840 


tccgtggaca 


agcagttcca 


tccaaggcag 


6900 


tttgcagtgg 


agtccaggcc 


tgggtccagg 


6960 


tgaagccagt 


ggctaacctg 


gttcaggcct 


7020 


agctgtgtca 


gacggacttc 


tgaaggcttg 


7080 


gacagttccc 


atgcatagct 


gggcagagtc 


7140 


gagctcagct 


tctgaagccc 


aggaaagagg 


7200 


cagcggtcgt 


cactgcagga 


ccatggagat 


7260 


ggcccagagc 


actggcgatg 


gagtaagacc 


7320 


actgtgtgac 


catggccaag 


gcaggcaacc 


7380 


atggtgctga 


cttcctgagt 


attaagggag 


7440 


tgcccggcat 


acagtgctca 


cccaactgtg 


7500 


atgtcccttc 


ctttcccctc 


tacccaatcc 


7560 


ccaaacttcc 


caaaactctt 


tccctgcatc 


7620 


ttcctggatc 


cacaagtcag 


acttaaaagt 


7680 


caggatacag 


tctgagagca 


gcctctctct 


7740 


atggagaaac 


catctctgac 


ctcaatccct 


7800 


ctgtccactt 


cctgggacag 


aaagactacg 


7860 


cccaagccct 


agtgcatccc 


acggcgtcag 


7920 


cgtgtgtcca 


gcccccttct 


aacttttttc 


7980 


ccagaacatt 


catttccaat 


tgttgagcca 


8040 


ctggagccag 


cctgccatgc 


ttcaagtcca 


8100 
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agtcacttac tggttgtgtg accatgggca 
aaaatgagag tgaggacggt atttacctca 
5 tgcacgtaga aggtttatct cagtgcctgg 
cgttaaatat ggacttcctg actcagtgtt 
tttcttccct tctctccagg aggcatgggc 

10 

cacctgcctt cagtcactct tcttggacca 
attgacctgc acctctcgca cctcccaact 
15 gagctctgtc ttccactggt tgtgaagttt 
agctgtgtct ttgcaacccc agaaaggctg 
cagattattc caggtcaaat caagaccaag 

20 

ttctggaaag atctggtccc ttaaacagag 
cctcagaaat gagacgggtt ctgacggcct 
25 ccagcatcaa tgaaagcaga agccacagtt 
tcatggcaag tcctcagccc agtccagccc 
cagcccaagc tcctggctcc acaccagcac 

30 

aaggagctgg gggtgagcat ccacttccca 
gggatatgag gaggatgctg gggggcccca 
35 gctttcctgc tgaacattca tctgaactgg 
ccacacctac tactgccttg tgaggtttgg 
aggctgccag ctccaggata gatagctccc 

40 

ttctgcagga aatcctgggg agtggtcctt 
gaggacagaa gggactcacc acagccagta 
45 ccagtgaccg atgccctcac cccagccatt 
ccatcgctct gctgcacctg gtctttgggg 
acctggtggt gctgaagtct tggtatccat 

50 

aaaacacaga ccgggtgtgg gagtctgcaa 
caggatgcag ggtgggatag agagagctaa 
55 ccagtgttcc tagctctgct tcttttatct 
agcccctgtc aattctgagt cccaagagag 
gaatatgtgc atttggcaaa gaggagggtg 

60 

ccagccttct gagcagccag gctgcctctg 
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agtggcttaa 


cctttctgag 


cctcatctgc 


8160 


tagggttgga 


atgaggacaa 


aatcaggtaa 


8220 


agcagagaaa 


gtatttgtaa 


atgttagcat 


8280 


ggctcataaa 


aagtcctcgg 


tggaagatat 


8340 


accaccacag 


gaggtaggac 


aattatttcc 


8400 


gtggatggag 


actgtttaca 


aagcccctcc 


8460 


cttccaaact 


gcagtctggc 


cactttgggt 


8520 


gcctggtcct 


catttttctg 


ccccaaaagg 


8580 


tgagcagcaa 


ttactcccat 


ctccaggggc 


8640 


accaggggct 


gccagacagg 


gcaaccacag 


8700 


gtgattggca 


gaggctcacc 


caggcacaag 


8760 


tcctccccgt 


cctgcagcca 


gggcccccat 


8820 


attcccagcc 


gttgtgctag 


ggggctccca 


8880 


tggcagacaa 


gtgcacccca 


gaacacgacc 


8940 


gagaagtccc 


agaagaagag 


cagccttctt 


9000 


gggtcccagc 


agtgggaggc 


agaggagagg 


9060 


agggaagaaa 


gggaaaagag 


aaaggctcca 


9120 


aaaaggcctt 


gaagattgga 


gagaatctgc 


9180 


acagggccct 


gtcctgggga 


gaatggcaga 


9240 


acccatgggg 


cccttggcag 


cttcagctct 


9300 


cttagagagc 


ctggtctctg 


ggatccctaa 


9360 


ggtccagggc 


actagtggcc 


tcaattagag 


9420 


ctctcttccc 


cacccaggcc 


ttccacatca 


9480 


gctacctggc 


ctctatagtc 


aagaaccttc 


9540 


tctggggggc 


tgcctctgtg 


agtagaaggc 


9600 


ggcaggcctg 


ggaggccctg 


gcatttggga 


9660 


taggagtgcg 


gcagaagcga 


ggccctttct 


9720 


tagtcacaat 


ggaaatagct 


ccttccaggg 


9780 


cacttggagg 


ctatccattc 


tcccataatt 


9840 


agtaagaccc 


agggacagct 


tcattcttcc 


9900 


gatccagcta 


caggtgccct 


gttggggttg 


9960 
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agctacaaaa acacacaggg tgtaaggtgt 
gatgggagat gacactggac cagtgcatgc 

5 

ggtctccagg gagcccaccc cattcctgac 
tccaagctcc acggcactgt cttgggcaag 
10 tgttctcact gcccacagga tggggctttc 
cagtgcttat tcaccctgtg ctcccaggga 
ggaaagaatg attgcagagg tgctcccaga 

15 

aactgcaaga ttcaagaaaa gagaaacagt 
agcgatggtt tgggtttgta aaatgaggga 
20 gcagggcaca gcttggtcca aggtgtagag 
gggaagatgt gaggtggggc agcttcttgg 
ctttgctgga ggaatagtat tccctgcccc 

25 

aataggggtg ggggtgggaa tttgtgggtt 
ttcaaatctg tcctgcagtt tctcatttca 
30 tctaaaactt acctggtgag tgggcacact 
taactttgtt aagggcatct gtctgtaggg 
ctaagagcag gctcttgcct tcaggacagc 

35 

cagacaggct gacagacttg gctgtgggag 
ttccagtgca gtgaggcaga gaaggaaatg 
40 gacacaactt tgcttcctct caaatctact 
caagccccag cctccagctg gatttgcatc 
gaacggtggt cctccctttt ctgcccgtgc 

45 

cctgcacaca tggctcccat ctttgactta 
ttgtcctccc ggagatcttc cctgacctcc 
50 tggacttacc tgtctctact gtacactaaa 
gtgatagccc aaggtggccc aagtaggcac 
gtggaggtgg atggatgatg gaaggaacat 

55 

tcccacctcc cctccttctg cccccgaagc 
tcagcctggc taggctgtgg ttgctcccta 
60 tggttgtgca gtttacccag cagaggcagg 
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ggctcctgtc ctcccaaggc catcagccag 10020 

accatttccc agaagctcta cattccttca 10080 

tcccaggtgg agtcctgcct atgcactaag 10140 

tatcccaatc cccctttcac atcagtccac 10200 

actgcacagt gatagaactt tccacacatg 10260 

ctagcccata gtaggtgctc agtgcacatt 10320 

gaccttggaa gcaagaactc atatcctccc 10380 

cggggtggag gggatagtgc cactgatccc 10440 

agcaggaggg tctaggaggg agttctaagt 10500 

atgggaggag gtatggaggt ttgtgcacat 10560 

atgggccaca tggcaggcaa gaggagctca 10620 

actggccgcc aagggttgat tttcttggag 10680 

ttgaggaacc ccagcttctc tcctttgact 10740 

gggatcttgg cgataacaat gaagaccttt 10800 

gagatcattc tcttccatct gagcatggca 10860 

gtgattgtga gggcacttct ccaggacttg 10920 

agaggatgtc caagcagaag ggtccagatg 10980 

gaaaggaagc accatctgaa gcttctctct 11040 

gctcccaggc accctgaccc atccatctag 11100 

gtgtcttaat cttaatcacc acatagcctg 11160 

tacttctcct ctctaagagc caggcagatg 11220 

tggccactgc ccaaacacct tgccctctcc 11280 

ctcctgacca ctcaagacaa cctgtatgtc 11340 

atggctgcct tccattctcc cacaactctg 11400 

ttatttctgg gcagaatgtg cccgagcata 11460 

agagtagaca cctagcacaa gactggatgg 11520 

ccctgcaaat cacagctcac acccaggtct 11580 

ccaagccagg tccttcccta tttgtgtccc 11640 

ctttctcaga gggcatacag aatagaagga 11700 

agatgcttga caaagacgtg cccattcccc 11760 
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agacctcacc ctcctgggct gagccccttt 
tgagcataag gacactctta ctgtaaagaa 
5 catatgtgca tagtctttcc tcactaggag 
gatgcttaac ccgagaggac gatgaataca 
agaaaatggg ctggtactgg gacatgtgag 

10 

tttctggtgg ggaaggggct ggatgagggg 
catagctgat gtaggatggt cggggctggc 
15 ttattgcaaa gctcttggat ctgaaggcag 
aggcaagggc cagctggacc ctcctcagtg 
tgtgaggggc tgaggggcca gtgacaggca 

20 

gtctctcctg aggctacaga ggaagctact 
gtctccaccc tttggacagc tgttaccttt 
25 atgacaaacc tcatcagcct cttttgcgtg 
ctctttctgg agagcccatt tgagtccccg 
agtacccagg cctggagggc cctctgactt 

30 

caggaggttt ccagctcaca gaaagcccct 
tttgccaacc tgtttctgtc ccctacaagg 
35 actggtcagc atcactgaat ccaagtccac 
ggggaggagc ctgcctgagg tcaagcagca 
aaactgctga tttcccagcc agggctctaa 

40 

gggagcctcc catcagcctt gcaaagctgg 
ggagtgaaca cggaggtcca gagaggctag 
45 ctcttttcac catatcactg ctgcctctga 
gccaggctcc aagaattctt ttttcttttt 
ctcttgtcac ccaggctgga gtgcagtggc 

50 

ccagattcaa gcaattctcc tgcctcagcc 
cacagccacc accacgccag gctaattttt 
55 gtcggccagg caggtctcga actcctgacc 
agtgctggga ttacaggcgt gagccaccat 
ttattcctta tctcctccca gcacatacac 

60 

tgtgcgagca tgcatggaca cacatgtgcg 
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tagaaccaca ggcctcttgc tttgtatacc 1182 0 

acccattcca gtataaaagg ctacagattc 11880 

gatttgcact tggcaagagg ctcttggaat 1194 0 

ggcatttcag gaaccatggc aggtgggcgg 12000 

gatagggcac tttcagagct cctcctgacc 12060 

tcacctgggt gtcgaatccc tgacacttgg 12120 

tttgagcagg aaatggcaat agggcatcct 12180 

ctcttgggtt ctcccaattt gtcccataga 1224 0 

gtcacacagg agcagctcag ctcaatcctg 123 00 

gagggggcct tgggcccagt gacaggcaga 12 360 

ggcaagcatg aggggaggac ctagccccag 124 2 0 

atttttcacc tatagaagat gttgtgcctg 12480 

ctgtctggcc tcttcgtcat ctccaaggat 12540 

atctggagaa tgtaccccaa ctccacggtg 12600 

cagcgaaggt gctgccttgg atttggccaa 12660 

cctggagccc aggtgtgggc tgctgggccc 12720 

tgtggtggaa aggtgggtgc tgggctgctg 12780 

ttttatagat tgggaaatgg ggccagagac 12 840 

acttgacggc aaatccagga ccacagccta 12 900 

gcatggtgtg cctggtggcc cctgtgatcc 12 960 

ccaagtccac tttgttaccc tcatcttctg 13020 

attagtgggt ctttgactcc aagcctggtg 13080 

agcagaaggc agaccactgg gtgcctctgg 1314 0 

ttttggaagg ggcgggggaa cggagtctca 13200 

acaatcgagg ctcactgtaa cctccgcctc 13260 

tctcgagtag ctaggattac aggcacccac 13320 

gtatttttag tagagatggg gtttcaccat 13380 

tctagtgatc ctcccgcctc agcctcccaa 13440 

gcccacaagg gttctgacca tagtggtttg 13500 

acacgtgcat acacatgcac atcacatgca 13560 

tgcacacaca cacatataca tatgcttccc 13620 
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catcaaatgc atgtgatact atgagatgac 
ctgtaaaaca tgctccctca ctctgggact 

5 

tcagatgacc ctaaggctac cttcattctt 
aaacgttagg ccaatactca ccaaagaccc 
10 gaccaagaca gaccaagtcc ccgtcccaca 
tgggagctga aaccccacct gccctctgct 
aagcccacgt gatgcgtgca gacacccagc 

15 

cctctcaggt ccacatccag aggctggagc 
tcttcctgcc agtgcccaca gctgtcacag 
20 taagacaagg gcttgtcttc ccaggaagac 
tcagaaagga aagatatgtg ggggttacaa 
gctgagcgtt cactaagtga cagcccacga 

25 

gagaggaggc caatccaagg agaaaagggt 
cccagaacaa gaggctactg tgctgccagc 
30 aggcccatga tgaccgggca ggattcggac 
gcagagcaag aagagcacga gcactggctt 
tatggtgtgt gtggagtttg gctgtagtgg 

35 

catttaagga attgcaggga ctctattaag 
tggatggatg atgtatggat ggatatagac 
40 ttaactttat cttctcaata accctttttc 



ggctggagta 


cagtggcacg 


atctcggccc 


attctcctac 


ctcagcctcc 


caagtagctg 


taatttttgt 


atttttttaa 


gtatagatga 


aactcctgac 


ctcaagtgat 


ccaccaccca 


tcagccatca 


tgcccagccc 


tcaacaaccc 


ctagcaagct 


gaggtccaga 


gaggtcacat 


aatccaggac 


tcagcctgag 


ctatgtccaa 


gatggcatag 


ggctagatat 


attttaaaat 


cctgaacagt 


tcaggataag 


gatgttttat 


ccctgaccag 


aagtgatctc 


taaggctcat 



tcccagctgc 


acggctcttc 


ccctcctctg 


13680 


aggtaccccc 


atctgtcaaa 


taggcttgga 


13740 


accagggacc 


catgagattc 


tccagccctc 


13800 


atatgtgcca 


ggcattgagt 


gtagggcagt 


13860 


gagacccacc 


ctgccttccc 


cacgcacacc 


13920 


gtggatctta 


gcaaggaccc 


ctgcctggct 


13980 


gaggcctcac 


cctgccttcc 


ttccctctac 


14040 


tggccttgct 


ctgcttcact 


gtcctagagc 


14100 


cctggagagg 


ggactgccca 


tctgcaaagg 


14160 


aaaaaatggg 


ggcatgaggc 


agcagagagc 


14220 


gaggagcgtg 


tttcattcat 


tcctcacgtt 


14280 


tccagtaggg 


gagccaaggc 


acgggggcaa 


14340 


gaggctgcca 


gagaggaaac 


tccatgggtg 


14400 


tctgaaggcc 


ctggggaaat 


tgcatcacag 


14460 


ccacaggcat 


aggtggaaga 


gagactttca 


14520 


ggagacagga 


cagagaaggc 


ctggggaagc 


14580 


aaacaagtga 


cagagtcatc 


aaagattggg 


14640 


catcaatcga 


tgcatccatt 


gatggatgga 


14700 


gtgcagatat 


acagataaat 


atagctgtat 


14760 


ttttgagact 


gagtctcact 


ctgtcaccca 


14820 


actgcaacct 


ccgcctcccg 


ggttcaagca 


14880 


ggactacagg 


catgtgccac 


catacctggc 


14940 


ggttttgcca 


tcctggccag 


gctggtctca 


15000 


tggcctccca 


aagtgctggg 


attacaggca 


15060 


tttgaggtac 


ttacattctc 


atactacaga 


15120 


cacttctcaa 


ggcgaagcag 


ccaattagga 


15180 


cccacagtct 


tgcccttgtc 


tacccatggg 


15240 


agaaggcaag 


tctctgctca 


ccccaaagat 


15300 


aaaagaaaaa 


aaataaataa 


aaaataaaga 


15360 


ttctttaaag 


aaaaaaaaat 


catattacaa 


15420 
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tctcaacttt agaacattat ttaaaaaagt 

cactgatttt catgttagtg acctcatcac 

5 gttgatggcc tctgagtttc atttaacatc 

gctgcctgac aggaagtctt ctcattaaga 

actgaggcca accccacttt taaggatggg 

10 

tatgagcaat aattgttcct atttattgac 

ctgcaggtgg ctgtattcat tttctattga 

15 cttaaaatga cacctgtatt accctacagc 

aagggctcaa atcaaggtgt gtcgccagga 

tctctttcct tgccttttcc agcttctaga 

20 - 

ttcttccacc ttcaaagcca gcatcacaac 

cctttaagga cccttatcat catactgggc 

25 tctcaaagct gattggaaac cttaattcca 

accagagatt agaacatggg tgtcttctgg 

agatgtcctt gaatctcacg gccattctgg 

30 

ggaaaaaatg aaccttagaa gagttacgtg 

ccatgccagg ttccccaatg ggccctgtgc 

35 caggctctcc agcactgagt tggtgccacc 

agcaatcatt ggcagacact gttggcatta 

agggacttgg gtcatggccc agcccccctc 

40 

ctggtcacca ggacactggt agcgggacgg 

tgggcctccc cctactccct tttcaagaac 

45 tatggaagtc gcttcgtctc tcaagcctca 

ttcagaagtg aactagcaga gaagtgtttc 

cagcaggggc ggcgactttt ctttcgcaga 

50 

cattgcatct caaaggcctg ccggtggagc 

gcgacgcaca acacaagcaa catcagaggt 

55 tcagaaccca cgcttggccc cttctccctg 

tgtaaggggc aaggggctaa cagtgctgca 

caaggcagca atggtcctat cccctcaggc 

60 

catctcagca cctgacttgg ttgaccagaa 
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gccaaatccc 


ccataattct 


tcaagtttga 


15480 


acaccaagtg 


ttcacataat 


gcatatataa 


15540 


aagcagagaa 


ggtgtctttg 


ttaattccta 


15600 


catcagaaag 


ccagatgggg 


gtcttgagtc 


15660 


gaaaactggg 


gccccatacc 


tagctgcagc 


15720 


ctttgctgtg 


ttccaggcac 


tgtgctagtg 


15780 


tgttgtagta 


aattatcaca 


aatttggtgg 


15840 


tctggaggtc 


agcggtttga 


aatgagtctc 


15900 


ctgcatccct 


tccgggggca 


tcaaaaagaa 


15960 


ggctgcctgc 


attccttggc 


ttgtggctcc 


16020 


atcctccaat 


aaccctgtct 


ccctcttcta 


16080 


ccagagagat 


aatccaggag 


aatctcccaa 


16140 


tctgcaacct 


taattcctct 


ctgccgtgtg 


16200 


gggccgtgct 


tctccctact 


gcactagcta 


16260 


gacgcaggtt 


ctgcttttcc 


attttaaaga 


16320 


acttcccatg 


gtctcacagc 


cagtaagtgg 


16380 


cctccccatc 


ggtaaggatg 


gtgccaggga 


16440 


ctcatcacta 


atcagggatg 


gaggtgtggc 


16500 


gacctgcccc 


tagagtgtat 


cctggctaga 


16560 


aggaaaccca 


acagggtgag 


tgacttgcac 


16620 


gaaccagagc 


cccatccagt 


gactcccatg 


16680 


agggagggtg 


gagggagtca 


cacactggtc 


16740 


gtttccccac 


attcatggag 


aacttgagtc 


16800 


ggaagcagcc 


agtcgctcca 


cagctgtgag 


16860 


atgatgatgc 


atgccttgtt 


ccgaatacac 


16920 


ccccgccatc 


ctaccagagt 


gtgattcaag 


16980 


gaagaggttt 


ggccctggct 


ccccagacct 


17040 


gcatagatag 


aaaccagttc 


ccaggggagc 


17100 


gctagaactg 


cctggcctgg 


gagtggtagc 


17160 


actagaggaa 


ctcccagggc 


cccccaggaa 


17220 


ctacctcagc 


tgaacctggg 


cgcaaccagg 


17280 
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acccatcgcc ttccctgcca gcctccagag 
cccccatgcc ctctggcctc ccttgcctgc 

5 

gatataaggg cagccaggaa ggaaggtctc 
ctcaacctac taagacatct ctgagggagc 
10 agggcagctg gggagttgag tcccatagaa 
ccctctgcat gaatcttttt gccctcctgg 
tgcaaggatg acatgatcca ggagtctccc 

15 

aaggttcaca gttgtgacat cgaggttgcc 
tacttgtgct aagtaaataa tgtgagggag 
20 cagtcatctg aggaatagtc ttcaggactt 
gttttttttc cacactgatt cttttttcta 
aagatatagg aaatcagtat ctttccataa 

25 

gtgaaataaa acgcaa.tttt cccctccaac 
cttgctagct ctgccccagc caccatggct 
30 acccctgcct ctgagccatc acccttgccc 
atttgcagag ctcactcttt gaatctttac 
tatttaatct caatgtcatt taatcctccc 

35 

taatatctcc atttcatgca ggctcagaga 
agtcactgac ggagctgcga tctggaccca 
40 atgcccaaac ttggttggag catagcccct 
gatgagattt gcacatacaa aaggctagag 
cagctcctgg gaacgctgtc ctctcagata 

45 

atctgtggtg gcccagattg ttttgttttg 
ttgagatgga gtctcgctct gttgcccagc 
50 tgcaatctcc gcctccctgg ttcaagtgat 
actataggca cacgccacca cgcccagcta 
caccatattg gtcagcctgg tctcaaactc 

55 

tcccaaagtg ctgggattac aggcgtgagc 
ccgagatagc agtctgctct ctactgcctt 
60 gtatcatttg cccttgcaca gaatatccct 
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caaacccctg 


ggcctgcagc 


cccctggtgc 


17340 


cttctctgat 


caatggtgga 


tgcacagggg 


17400 


cagagcggca 


caggcagggg 


cttcccttgc 


17460 


ctgtgtgtct 


gaatccctgg 


atttgggctg 


17520 


actgtgtaca 


gttcttccct 


ctgaactaga 


17580 


acctcagtgt 


ctccatctgt 


aaagtggagg 


17640 


agcttgaaag 


ctctgagatc 


aggagaattc 


17700 


agtaaaactt 


cagcctcgga 


atttcaaacc 


17760 


agggacctgc 


cttcctcatt 


ttcctaattt 


17820 


ttgcttattt 


gtgagctgca 


atgtattctg 


17880 


tattatttta 


aaggggcact 


ttgtagcagc 


17940 


acagaagcaa 


accccaaaaa 


taacaaacat 


18000 


cccgtggtct 


cacccactgc 


ttcccttcct 


18060 


tccttgaggt 


tccacagaca 


ccccagggac 


18120 


ttcctccacc 


tggattctct 


tcccccaggt 


18180 


tgggccactg 


actctgagcc 


aagtaaatgc 


18240 


atcgatctgt 


gacctagctg 


ctgttgctat 


18300 


agttaagcaa 


gttgccccgg 


acacatggat 


18360 


gactgcaaac 


tgaagagcca 


ctgcctgaca 


18420 


gctctcccaa 


agttgcactt 


tcactgggaa 


18480 


cgatggtcta 


tacagcaaag 


tcagccctca 


18540 


agccatttct 


tacatagttg 


atggctcgat 


18600 


tttcgctttg 


ttttgttttc 


ttttgttttg 


18660 


ccagagtaca 


gtggcactat 


ctcagctcac 


18720 


tctcctatct 


cagcctccca 


agtagctggg 


18780 


atttttatat 


ttttagtaga 


gacaggattg 


18840 


ctgacctcag 


gtgatccacc 


tgcctcagcc 


18900 


cactgtgccc 


agcccaggtt 


ttgaagttgt 


18960 


ataaaatccc 


tgtgtgaagg 


gatgctctca 


19020 


ggggtttgag 


gttctttgaa 


ttctccctct 


19080 
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ttgtcatctc tttcgctgcc acttctggct gtggtcacta gcttggccat agcacctctc 19140 

ttctccactt ctgatctgct gcttctaacc ttctatagat tgcagctggc tttaaaatag 19200 

attgtaaagt gtaaggcatt aggttctgag acagcggcag agagagccat gcaaatgttt 19260 

aggacaaccc agtctttctt tttttttttt tttttttttt ttt 19303 
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As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

As only some of the required additional search fees were timely paid by the applicant, this international search 
report covers only those claims for which fees were paid, specifically claims Nos. : 



4. |/\| No required additional search fees were timely paid by the applicant. Consequently, this international search report 
is restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 1 , SEQ ID NO: 1 

Remark on Protest \~\ The additional search fees were accompanied by the applicant's protest. 

• 1 1 No protest accompanied the payment of additional search fees. 
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BOX H. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This application contains the following inventions or groups of inventions which are not so linked as to form a single general 
inventive concept under PCT Rule 13. 1. In order for all inventions to be examined, the appropriate additional examination fees must 
be paid. 

1-38. Claim l r drawn to an isolated MS4a polypeptide, or functional portion thereof, comprising a polypeptide encoded by the 
nucleotide sequence of one specific SEQ ID NOs :l-38. 

39-76. Claim 2-7, drawn to an isolated nucleic acid molecule encoding a MS4A polypeptide, comprising the nucleotide sequence of 
one specific odd-numbered SEQ ID NOs: 1-37 chimeric gene, vector and a host cell. 

77-1 15. Claim 8, drawn to a methods of deleting a nucleic acid molecule that encodes a MS4A polypeptide, the methods comprising 
hybridizing the nucleic acid molecule comprising the nucleotide sequence of one specific odd-numbered SEQ ID NOs: 1-37 to the 
biological sample. 

116-154. Claim 9, drawn to an antibody that specifically recognizes a MS4A polypeptide comprising a polypeptide encoded by the 
nucleotide sequence of one specific SEQ ID NOs : 1-38. 

155. Claim 10, drawn to a method for producing an antibody that specifically recognizes a MS4A polypeptide. 

156. Claim 11, drawn to a method for detecting a level of MS4A polypeptide. 

157-195. Claim 12, drawn to a methods for identifying a substance that modulates MS4A function, the method comprising isolating a 
MS4A polypeptide comprising a polypeptide encoded by the nucleotide sequence of one specific SEQ ID NOs : 1-38. 

196-234. Claims 13-16, drawn to a method for modulating MS4A function in a subject, the method comprising preparing a 
pharmaceutical composition, comprising a substance identified according to a methods for identifying a substance that modulates 
MS4A function, the method comprising isolating a MS4A polypeptide comprising a polypeptide encoded by the nucleotide sequence 
of one specific SEQ ID NOs :l-38. 

235. Claim 17 and 18 drawn to a method for identifying a candidate compound as a modulator of MS4A gene expression. 

236. Claim 19, drawn to a method for modulating MS4A function in a subject, the method comprising preparing a gene therapy 
vector. 



This International Searching Authority considers that the international application does not comply with the requirements of unity of 
the invention (Rules 13. 1 , 13.2 and 13.3) for the reasons indicated below: 

The invention listed as a Group 1-236 do not relate to a single general inventive concept under PCT Rule 13.1 because, under PCT 
Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 

The special technical feature of Groups 1-38 is considered an isolated MS4A polypeptide, or functional portion thereof, comprising a 
polypeptide encoded by the nucleotide sequence of one specific SEQ ID NOs : 1-38. 

The special technical feature of Group 39-76 is considered an isolated nucleic acid molecule encoding a MS4A polypeptide, 
comprising the nucleotide sequence of one specific odd-numbered SEQ ID NOs: 1-37 chimeric gene, vector and a host cell! 
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The special technical feature of Groups 77-1 15 is considered a methods of detecting a nucleic acid molecule that encodes a MS4A 
polypeptide, the methods comprising hybridizing the nucleic acid molecule comprising the nucleotide sequence of one specific odd- 
numbered SEQ ID NOs: 1-37 to the biological sample. ^ 

The special technical feature of Groups 11 6- 154is considered an antibody that specifically recognizes a MS4A polypeptide 
comprising a polypeptide encoded by the nucleotide sequence of one specific SEQ ID NOs : 1-38. " 

The special technical feature of Group 155is considered a method for producing an antibody that specifically recognizes a MS4A 
polypeptide. 

The special technical feature of Group 156 is considered a method for detecting a level of MS4A polypeptide. 

The special technical feature of Group 157-195 is considered a methods for identifying a substance that modulates MS4A function the 
ED NOs °!^ aDg ******* a MS4A polypeptide comprising a polypeptide encoded by the nucleotide sequence of one specific SEQ 

The special technical feature of Group 196-234is considered a method for modulating MS4A function in a subject, the method 
comprising preparing a pharmaceutical composition, comprising a substance identified according to a methods for identifying a 
substance that modulates MS4A function, the method comprising isolating a MS4A polypeptide comprising a polypeptide encoded by 
the nucleotide sequence of one specific SEQ ID NOs: 1-38. 

The special technical feature of Group 235 is considered a method for identifying a candidate compound as a modulator of MS4A • 
gene expression. 

The special technical feature of Group 236 is considered a method for modulating MS4A function in a subject, the method 
comprising preparing a gene therapy vector. 

Accordingly, Groups 1-236 are not so linded by the same or a corresponding special technical feature within meaning of PCT Rule 
13.2 so as to form a single general inventive concept 
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